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COOPERATION AMONG STATISTICAL AND OTHER SOCIETIES 
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Morris H. Hansen 
Bureau of the Census 


Delivered at the 120th Annual Meeting of the American Statistical Association, 
Stanford University, Stanford, California, August 25, 1960. 


INTRODUCTION 


HE American Statistical Association, organized over 120 years ago, is one 
es the oldest scientific societies in the United States. From the beginning its 
areas of interest have had considerable breadth, although the principal empha- 
sis in the original organization and activities was strongly on the development 
of factual information to guide governmental policy, programs, and adminis- 
tration in the general areas of economic and social development and public 
health. 

We are now witnessing unprecedented growth and expansion of statistical 
theory and applications accompanied by increased recognition of the role of 
statistics by workers in the various subject fields. Although applications and 
supporting theory are growing in almost every area of economic and social 
activity and of science, potential contributions are still generally underdevel- 
oped. Organized statistical groups, and especially the American Statistical 
Association, have an obligation to stimulate progress and to provide leader- 
ship, services, and channels of communication for facilitating effective devel- 
opment. This situation poses some questions and challenges that merit discus- 
sion by members of this Association and also by others interested in the devel- 
opment and application of statistics. 

The rapid expansion in the role of statistics in the past few decades has led 
to the organization of a number of new scientific societies with an important 
interest in statistical theory and methods or in the development, application, 
and improvement of statistics in various subject matter areas. Thus, the 1930 
to 1940 decade saw the organization of the Institute of Mathematical Statistics, 
the Econometric Society, the Population Association of America, and the 
Psychometric Society. Since World War II other societies added include the 
American Association for Public Opinion Research, the Operations Research 
Society of America, the Institute for Management Sciences, the Biometric 
Society, and the American Society for Quality Control. 


1 
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THE 1948 REORGANIZATION OF ASA 


The Presidents of the Association in 1944 and 1945, Helen Walker and 
Walter Shewhart, and others, gave considerable attention to expanding re- 
quirements, and to the role and organization of the Association, and proposed 
steps for adapting ASA to meet the new challenges and changing conditions. 

Walter Shewhart in his 1945 Presidential address discussed the widespread 
gains in statistics and summarized with the following remarks: 

“The statistical front is the whole field of science. . . . 

“To insure continued advancement on the statistical front, we need a strong pro- 
fessional organization backed by all statisticians working in the many different sub- 
ject matter fields: an organization whose purpose shall be to foster in the broadest 
manner the science of statistics and its applications; and to promote a unified, effective 
organized effort in advancing the common interests of all professional statisticians. 
“To this end, such a professional organization needs to conduct joint meetings and 
conferences with many organizations in the fields of pure, background, and applied 
research with whom official contact has never been made in the past; to establish 
joint committees with other scientific groups wherever necessary; to produce and 
distribute publications devoted to statistical methodology and statistical knowledge 
of common interest to all professional statisticians; to provide the public through 
the press, lecture platform, and radio with reliable information about the science of 
statistics and its contributions; and to stimulate statistical research and promote high 
professional standards in all statistical work. 

“I hope and believe that the American Statistical Association, with the cooperation 
of other organized societies interested in statistics, can be developed into a strong 
organization to serve the common interests of all professional statisticians in these 
groups.” ” 


Constitutional changes, introduced in 1948, established the present form of 
the ASA.’ These changes were directed at strengthening and encouraging the 
development of sections and local chapters within the ASA. They also at- 
tempted to strengthen communications and joint activities by establishing an 
arrangement whereby other societies with a strong interest in statistics would 
be encouraged to become associated with and participate in determining the 
policies and activities of the ASA. An associated society is represented on the 
ASA Council, and on the Publications Committee, and on the Editorial Board 
of the American Statistician, and thus has a voice in the ASA policies and pro- 
grams while retaining its status as an independent and autonomous society. 


PROGRESS IN SERVING WIDER ROLE OF STATISTICS 


Many of the efforts towards broadening the role of ASA have met with suc- 
cess, and some have not. The membership of the ASA has more than doubled 
in the last 15 years, with 3400 members at the end of 1945 and more than 7500 
today. The chapters have grown from 24 in 1945 to 41 today (with two new 
chapters this year), and their meetings and other activities represent an im- 
portant service to members and other groups. It seems to me that through the 
sections ASA has been especially successful in the effort to fulfill a broader 
central role. In 1945 we had a Section on Training of Statisticians, and a Bio- 
metrics Section. The Business and Economic Statistics Section was organized 
in 1950, the Section on Physical and Engineering Sciences in 1953, and the 





) Minor revisions occurred subsequently; the date of the p t Constitution is January 1954. 
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Social Statistics Sectic “1 1954. Under the general ASA program chairmanship, . 
the sections take mucn of the responsibility for planning the programs for the 
annual meeting. Moreover, they initiate and support training programs and 
special studies, organize various joint and special meetings, prepare publica- 
tions, and engage in other activities. They have a strong voice in the manage- 
ment of the Association through representation on the Council. Thus, the 
chapters and the sections broaden the activities of ASA to help serve the 
needs of various groups. 

The steps taken in 1948 and subsequently, to bring ASA into a position to 
serve as an association of statistical societies have not been successful. Only 
one group, the Eastern North American Region of the Biometric Society 
(ENAR), has become an associated society. I shall review, briefly, the rela- 
tionship of ASA to some of the other scientific societies which have a strong 
interest in statistical methods or applications. 


COOPERATION BETWEEN ASA AND OTHER SOCIETIES 


The Institute of Mathematical Statistics was formally organized in 1935, 
but was in a sense initiated in 1931, when the Annals of Mathematical Statistics 
was initiated. The Annals was given some initial support by the ASA, when 
it was recognized that the Journal of the American Statistical Association was 
not adequately serving the need for a journal reporting on advances in mathe- 
matical statistics and probably should not be modified so drastically as to 
serve this need. In the years since the Annals was initiated and the Institute 
of Mathematical Statistics organized, the Institute has grown and come to 
serve an essential and basic role in statistics. The IMS has chosen not to 
become an associated society of the ASA, but there has always been close 
cooperation between the two societies. Nearly two-thirds of the members of 
the Institute are also members of ASA, and about one-sixth of the much 
larger and more diverse membership of the ASA belongs to the Institute.? The 
ASA and IMS meet jointly at many of their respective annual meetings, ap- 
proximately in alternate years, and at various sectional, regional, and chapter 
meetings. Nominations to the ASA Board are usually made so that the Board 
will include some members of the IMS Council, in order to facilitate coopera- 
tion and communication. Currently, half of the ASA Council are members of 
the Institute. Frequently, special joint arrangements have been made to per- 
form functions of general interest to statisticians. The IMS regularly meets 
also, with the Mathematical Society (as well as other societies) and provides 
liaison between the mathematicians and the statisticians, and I understand 
that some IMS members feel that it would be unwise for the Institute to de- 
velop too close a tie either to statistics or mathematics because it might tend 
to weaken its relationship to the other. 

The Biometric Society was organized in 1949, as an international society, 
and the Biometrics Bulletin (initiated and’published at that time by the Bio- 
metrics Section of the ASA) was transferred to the Biometric Society to serve 
as its journal. Roughly half of the North American members of the Biometric 





2 These and subsequent estimates obtained primarily from Abe Rothman and Edgar M. Bisgyer, “Character- 
istics of the membership of the American Statistical Association, 1958,” The American Statistician, Vol. 13, No. 4, 
October 1959, pp. 31-7, supplemented by some samples from directories. 
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Society are also members of the ASA, and these constitute about a tenth of the 
ASA membership. As indicated earlier, the ENAR of the Biometric Society is 
an Associated Society, and cooperation has been close. However, the Bio- 
metric Society last year decided to terminate an arrangement whereby ASA 
members were granted a special rate of $4 on their journal, Biometrics. I believe 
that the Biometric Society concluded that most ASA subscribers would become 
full members at $7 if this special rate were not available, and that this has proved 
to be true. 

The American Society for Quality Control was organized in 1946, bringing 
together engineers, statisticians, engineering aides or quality control super- 
visors, and others interested in industrial quality control. I believe the pri- 
mary orientation of this society is toward engineering as its area of activity; 
however, quality control depends on statistics for development of its basic 
methods, and an important segment of the ASQC membership is primarily 
interested in statistics. About 8 per cent of the members of ASQC are also 
members of the ASA, and about 10 per cent of the ASA members belong to the 
ASQC. One highly effective step toward cooperation between the two organi- 
zations is the joint sponsorship of the new journal Technometrics, which is in its 
second year and is being well received. 

The Econometric Society meets jointly with the ASA at many annual and 
other meetings. Roughly one-fourth of its members are also ASA members, and 
cooperation has been long-standing. 

We have little experience with joint meetings or other joint efforts, on the 
other hand, with the Psychometric Society (50 to 60 per cent of its members 
are in ASA), or with the Operations Research Society of America (roughly 
one-fifth of its members are in ASA), or with The Institute of Management 
Sciences (roughly one-fifth of its members are in ASA), or with the American 
Association for Public Opinion Research (about one-fourth of its members are 
in the ASA) or with the Population Association of America (about half of its 
members are in ASA). 

The Society of Actuaries and the ASA have had relatively little direct com- 
munication, and only a very small common membership, although perhaps a 
fairly substantial area of common interest. 

Cooperation has been extensive and long standing with the American Eco- 
nomic Association, especially in the form of joint meetings. Nearly 20 per cent 
of the members of the AEA are members of the ASA, and these constitute a 
little more than 20 per cent of the membership of the ASA. We meet jointly 
with the economists along with a number of other Social Science societies in 
many of our annual and other meetings. 

The per cent of overlap with the American Marketing Association is of the 
order of 10 per cent, each way, and there is an extensive history of joint meet- 
ings. 


ROLE OF SECTIONS AND DEVELOPMENTAL COMMITTEES OF ASA 


In considering the cooperation of ASA with other societies it is worth par- 
ticular mention that the Biometrics Section of the ASA has continued, subse- 
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quent to the organization of the Biometrics Society. This Section has remained 
strong, healthy, and effective and has been the primary instrument of coopera- 
tion with the Biometric Society. William G. Cochran, in his ASA presidential 
address in 1953, suggested some good reasons for continuation of the section as 
a meeting ground for biometricians with other statisticians in the setting of a 
statistical society, and for the Biometric Society as a primary meeting ground 
for biometricians with biologists. This may have established a useful pattern. 

The same approach has, to some extent, been followed in other fields, with 
the Business and Economic Statistics Section, the Social Statistics Section, and 
the Physical and Engineering Sciences Section representing various subject 
interests in the ASA development and programs, and serving as liaison groups 
with certain counterpart societies outside ASA. 

There are a number of other fields where committees have been or are in 
process of being appointed to guide ASA in the determination of its appropri- 
ate role, to stimulate work within ASA, and to develop joint activities with 
other societies. Some of these activities may lead to new sections. 

Considerable progress has been made in the application of statistics in many 
new fields, and accounting is one where recognition of the possibilities and 
work on advancing methods and applications are in progress. We have recently 
appointed a committee in this area. Joint work with the accountants until 
recently has been very limited—perhaps this is the beginning of a more exten- 
sive development. 

Recently, also, ASA has appointed a committee to work with the Meteoro- 
logical Society Committee on Agricultural Meteorology, and, in addition, to 
explore generally our appropriate role and activities in meteorology. Perhaps 
this committee, or possibly another one, should consider also, the role of sta- 
tistics in the broader area of what might be called the atmospheric and space 
sciences. 

We are currently working on the establishment of committees in three other 
areas: Management or operations research, marketing, and computers. These 
committees will explore the statistical role in each area, examine the needs, 
and stimulate appropriate activities. Perhaps other areas need attention. The 
Council, the officers, or the Committee on Committees would welcome ideas 
and suggestions. : 

It might be worthwhile, for example, to establish a committee on applied 
statistics, without regard to the particular subject field of application. The 
basic principles and to a very substantial degree the methods of statistics are 
general in their applicability, whether in various fields of social science, or in 
the natural or physical sciences. Many statisticians are highly effective as 
specialists or consultants in developing and applying statistical methods to 
problems without specialization as to subject matter. They are able to per- 
ceive the essential nature of problems involving uncertainty in a wide variety 
of situations and to develop and apply statistical and other quantitative meth- 
ods to aid in their solution. The contributions of the general statistician, who 
can take a scientific approach to any one of a wide variety of problems involv- 
ing the collection and use of data to guide decisions, as distinguished from de- 
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veloping theory for its own sake, or from being a statistician-specialist in a 
subject-matter field, seems to be inadequately recognized in the ASA struc- 
ture at the present time. 


NEED FOR COOPERATION AMONG STATISTICAL SOCIETIES 


I mentioned earlier the recommendations by Walter Shewhart and others of 
the need for a more unified organization and approach in promoting the fuller 
utilization and development of statistics, and the steps ASA took in an effort 
to serve as a unifying and central statistical organization. Certainly these 
steps have led to progress in many important respects, as is indicated by the 
diversity and progress of sectional activity just described. However, they have 
resulted in little progress in bringing together a formal association of societies. 

Rensis Likert, president of the ASA last year, recognized the desirability of 
clarification of goals and of what should be done. He discussed with various 
groups the possibility of increased cooperation among societies with a strong 
interest in statistics. After obtaining an indication of interest and approval 
from the ASA Council, and from a number of other societies, he proposed or- 
ganized discussions on this type of cooperation extending over a considerable 
period of time. He also laid the groundwork for a modest grant of funds from 
the Rockefeller Foundation to support exploratory discussions. 

This year we have completed the formal arrangements, have received a 
grant, and with the IMS and ENAR of the Biometric Society as joint hosts, 
we have arranged for an initial meeting next month among representatives of 


a number of societies with a strong interest in statistics. We also appointed an 
ASA committee, to consider implications and alternatives, to bring together 
and summarize some of the ideas of a wide representation of our members on 
this subject, and to advise the Board of Directors and the Council. 


PURPOSES OF FURTHER COOPERATION 


I should now like to summarize some questions and comment on the pur- 
poses, methods, and possible results that might come from more effective co- 
operation, as a background for further consideration and discussion among 
ASA members and for our discussions with other societies. 

(1) Is there a need for a coordinating organization or approach among pri- 
marily statistical societies, beyond what is now accomplished by the coopera- 
tion among ASA, IMS, the Biometric Society, and others? What purposes 
are to be served that are not served just as well by the existing methods of 
achieving cooperation? 

In my opinion there are few if any specific programs that could not be ac- 
complished by the cooperation of the type that has been existing; moreover, 
exploration by the ASA committee and others has indicated that there is a con- 
siderable but not a predominant body of opinion that this is a sufficient and 
adequate approach. However, the fact that needed programs might be accom- 
plished does not mean that they will be accomplished, short of more sustained 





* This initial meeting was held on September 16, 17 and 18, and a strong interest was expressed i increased co- 
operation by all representatives present. An intersociety committee was appointed to carry on further explorations 
and develop a proposal. 
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strong leadership than can now be marshalled. There is rather widespread 
opinion that progress is hindered by lack of a more coordinated organization 
or more effective cooperation. Some of the purposes to be accomplished by 
greater cooperation that have been discussed include the following: 


(a) The Federal Government in its scientific, economic, international, personnel, and 
other programs, looks to scientific societies in various fields for effective advice and 
guidance. Grants of funds for research and feilowships are made on the basis of such 
advice and it is available in most scientific areas. For example, the National Acad- 
emy of Sciences calls on scientific organizations for advice through national com- 
mittees representing scientific disciplines. There is no such national committee for 
statistics. Again, the U. 8. Civil Service Commission has requested advice on the 
classification and recruitment of statisticians. To some extent ASA and IMS are 
now providing such advice, but it is not unified and many believe it is not strong, 
or even heard in many instances where it should be. 

Sustained and strong leadership is needed if we are to develop desired projects, pro- 
mote new activities, or get more general recognition of the nature and role of sta- 
tistics, and consequently fuller utilization in the widespread areas in which statistics 
can make basic contributions. The breadth of use of statistics is growing extensively, 
but still is far short of what it could be as a fundamental tool of science, of adminis- 
tration, and of policy guidance. Stronger leadership and a stronger voice should help 
in achieving these aims. I have felt, for example, in my experience as President of 
the ASA this year, that there are many wavs in which advances should be made, 
but that have been seriously handicapped vy inadequate resources. Progress has 
been made, but the steps taken have been inadequate—in spite of—in my opinion— 
rather considerable efforts within the bounds of what can reasonably be dene with 
limited available time of the officers and part-time executive direction. It seems 
to me that a full-time executive director and fuller staff support could add greatly 
to our ability to develop and progress on worthwhile activities, and that prospects 
for providing these could be greatly enhanced with a stronger central statistical 
organization. A more unified approach can more readily support the kind of staff 
and leadership that is needed to conceive, explore, promote, and carry through the 
wide variety of activities on which we should be engaged. 
Communication among producers of theory, applied statisticians, and consumers 
of statistical consultation and statistics in various subject areas might be strength- 
ened by a stronger coordinating organization. Perhaps these could be helped not 
only by joint meetings but by more survey or expository articles, by stronger efforts 
to bring producer and consumer together, by the fuller development of training 
materials, and by other devices that need to be recognized and promoted. 
Some of the steps taken to achieve improved standards of statistical work should 
involve cooperative effort. There are some who believe that the widespread adoption 
of standard methods, or of standards for certifying statisticians might handicap 
progress rather than promote it. But even if this is assumed there are still many 
areas in which more work on standards might be of real assistance, including the 
general areas of terminology, notation, graphic and tabular presentation, and avail- 
ability to review, evaluate, and promote improved statistical work as ac to im- 
portant problems in which there is a strong public interest. 
Many specialized services can be developed as an aid to the more effective accom- 
plishment of the programs of the cooperating societies. Such services are not essen- 
tial to but may be facilitated by and may be a highly useful outcome of greater 
cooperation. On the basis of the experience of other societies it appears that some of 
the service functions developed would be adopted by some but not all participating 
societies. Among the services that might be developed are the following: 

(i) A central membership directory. 

(ii) Management and editorial services for scientific journals and other publica- 

tions. 
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(iii) Public relations services. 

(iv) Staff work to guide the preparation for, and organization of, conferences and 
meetings. 

(v) Central file maintenance for membership subscriptions, and related opera- 
tions, including addressing and mailing. 
A placement service. 
A visiting speakers service to aid universities, local groups, and others to get 
access to qualified speakers on statistics. 
Special bibliographic and book review services, bringing together indexes, 
abstracts, guides, etc. 
Exchange arrangements on subscriptions to journals and other publications 
of the societies that will reduce costs and encourage more widespread reading 
and communication across the applications and developments in different 
subject areas. 

(x) Numerous other management and staff services. 


(2) Societies in other disciplines generally have faced this same question, 
as their roles have been widened and activities specialized in a diversity of 
areas. As the biologists, psychologists, physicists, mathematicians, chemists, 
and others have faced the situation of increased diversity of interests they have 
provided for a stronger coordinating or central organization. There seems to 
be rather widespread acceptance of the view that increased effectiveness has 
been achieved as a result. 

(3) A basic principle that seems essential to progress in this area is recog- 
nition of the autonomous role of the various societies—that whatever coopera- 
tion is achieved may create stronger channels of communication, the stimula- 
tion of more active development, a central voice, and central services, but can- 
not in any way alter the independence of action, prestige, and recognition of 
the individual groups. 


ORGANIZATION FOR IMPROVED COOPERATION 


Some other questions need consideration in discussing increased cooperation 
among statistical societies, and have received some attention. Some of these 
include the following: 

(1) Should a relatively weak central organization or a strong one be created? 
We might have a committee or association which exists and meets, but with 
little staff, to serve as a communication and clearance medium among the 
societies, and as a channel for some types of communication between the soci- 
eties and outside activities. Alternatively, we might have a strongly staffed 
and aggressive group, with strong professional and administrative leadership, 
and with resources in time and money to carry out programs, obtain grants of 
funds, encourage and promote new activities, etc. 

(2) To what extent should such a central organization be a n«tional or an 
international organization? As I understand it, ASA today is broader than a 
United States organization, perhaps is better regarded as a North American 
organization, serving in particular statisticians in the United States and Can- 
ada, but with a considerable number of members throughout the world. Is this 
the character that should be retained in a proposed central organization? It 
might be national or might have even a broader international role. In the latter 
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event, could it also have a recognized authority to appoint national commit- 
tees or groups to serve national purposes? 

(3) Should a central or coordinating organization or group be only an or- 
ganization of societies, an organization of individual people as members, or 
both? This, I suppose, should depend on the conclusion, to be reached after 
further discussion, regarding which services are to be provided. It appears that 
an organization of societies can be most effective in accomplishing some of the 
functions that have been discussed. Perhaps it will be stronger, financially and 
otherwise, if it provides services both to societies and to individual members. 
This latter pattern has been followed in the case of a number of other groups, 
but organizations of societies have also been followed. At least this is an area 
for consideration and further discussion. 

(4) Should the instrument for achieving a coordinated society be the Amer- 
ican Statistical Association, or should a new organization be created, such as a 
Committee on Statistics? 

ASA now has a central role, and has a staff of some strength, although it is 
inadequately staffed and supported for carrying the role that seems needed. 
In fact, ASA is to a considerable extent providing the needed leadership, gen- 
eral representation, and stimulation of cooperation. Also, we have seen that 
ASA membership has a wide diversity of interests and represents statisticians 
working in the various areas of statistical development and application. More- 
over, statisticians with a primary interest in a specialized field seem particu- 
larly likely to be members of ASA if they also have a broader interest in sta- 
tistical developments and applications. 

An effective organization, supported by strong staff and other resources, may 
be expensive and difficult to accomplish if we continue to use and to strengthen 
the ASA organization and also attempt to create another one. An attempt to 
build another staff and organization—in addition to that of ASA—would 
seem doomed to ineffectiveness unless it took over ASA functions, resources, 
and staff. ASA is the organization in which statisticians communicate pri- 
marily with statisticians, rather than with mathematicians or subject-matter 
specialists. In my opinion, a step that weakens ASA, unless it at the same time 
replaces it with a clearly strengthened general statistical organization, would 
serve to decrease rather than increase the effective support of statistics. More- 
over, it would appear that a stronger organization could most easily be achieved 
by appropriately modifying ASA in whatever respects seem desirable. Alter- 
natively, perhaps an overall coordinating organization could be created to 
to serve as a policy board, depending upon ASA staffing and instruments to 
carry out its roles. 

It seems to me that an approach that involves either adapting ASA or 
utilizing ASA staff and other resources to serve the broader functions, holds 
considerable promise and should receive serious consideration and attention. 


NEED FOR CONTINUED EFFORTS IN ASA 


Finally, I should like to make a few remarks about the program of ASA and 
its relationship to future developments of a central or coordinating organiza- 
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tion, whatever form it might take inside or outside the structure of the Amer- 
ican Statistical Association. 

There is sometimes a tendency already observable in some of our own actions 
and programs (I suppose this may be true in other societies, also), to put off 
steps that we might take provided such steps might be affected by or have 
some effect on the discussions of a coordinated approach. There is a tendency 
to postpone and await developments of the contemplated discussions. I think 
that such procrastination can be only a setback to statistics and to the areas of 
endeavor in which statistics is increasingly of vital importance, and that we 
should proceed aggressively and as effectively as we know how in the various 
programs of this Association, by taking various steps such as those mentioned 
earlier, as they seem to be needed. At the same time, it seems to me we should 
cooperate and participate to the fullest in discussions among statistical soci- 
eties on what needs to be done and what steps should be taken in arriving at 
a stronger and more coordinated effort in the area of statistics. 
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The test, proposed by Moore and Wallis [23], of the existence of cor- 
relation between the movements in two time series (eliminating at least 
the primary effects of trends) was justified by them in “the special case 
of randomly arranged [signs of the] first differences” in both series. 
Moore and Wallis also noted that the signs of the first differences in a 
time series will have a negative serial correlation if the time series is a 
purely random process (or if it is a particular kind of generalization 
of a purely random process). It is therefore necessary to modify the 
Moore-Wallis test when applying it to such time series in order to 
take into account this serial correlation. In the present article, a simple 
modified form of the Moore-Wallis test is suggested, which can be 
applied to such time series. Somewhat similar modifications are also 
suggested for testing the existence of correlation between some more 
general kinds of time series. The importance of taking into account the 
effects of trends in the time series and the effects of the serial correla- 
tions (between the signs of the first differences) is illustrated by means 
of two numerical examples. Finally, the study of the movements in 
three or more time series is briefly discussed. 


1, INTRODUCTION 


et { Xo, Xi,--+, Xn} =X and {Yo, Yi, ---,Y 2} =Y be two different ob- 
[ served time series of equal duration. Let Xi4:—-X;=W; and Yiyi— Y;=Z; 


fori=0, 1, - + - ,n’ where n’=n—1. For the sake of simplicity, we shall assume 
throughout that the W; and Z; have continuous distributions. Let U;=1 if 
W;>0, and U;=0 if W;<0. Let V;=1 if Z;>0, and V;=0 if Z;<0. The joint 
distribution of the n pairs of observations (U;, V;) can be summarized in a 2X2 
cross classification table. G. H. Moore and W. A. Wallis [23] have suggested 
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who knew him felt equally the impact of a profoundly gifted personality. He was an intensely spirited, alive and 
generous human being. We have suffered a great loss, but are grateful that he was with us, ¢ ven if for so short a time. 
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that a test of the existence of correlation between the movements in the two 
time series, which eliminates at least the primary effects of trends, can be based 
on the usual test of independence applied to the 2X2 cross classification table. 
They state that “although such a test would be of great value, the sampling dis- 
tribution [of the test statistic] is known only for the case of randomly arranged 
[signs of the] first differences.” They also mention that if this test, which is 
justified by them when the signs of the first differences are randomly arranged, 
leads to acceptance of the hypothesis that correlation is absent, then it surely 
would lead to acceptance of this hypothesis “if the proper null hypothesis of 
random observations were used; but nothing more can be said definitely until 
the sampling distribution [of the test statistic] appropriate to the assumption 
of random observations is known.” This sampling distribution will be derived 
herein for observed time series of long duration. This result leads to a simple 
modification of the test presented in reference [23]. 

The joint distribution of the n pairs of observations (U;, V;) can be summa- 
rized in the following 2X2 cross classification table. 


V 




















This table gives the number, a, of times when both series X and Y increased 
(i.e., the number of positive comovements); the number, d, of times when X 
and Y decreased (i.e., the number of negative comovements) ; the number, b, of 
times when X increased and Y decreased; the number, c, of times when X de- 
creased and Y increased. The total number of comovements is a+d; the total 
number of contramovements is b+c; the number of positive movements (i.e., 
increases) in series X is a+b; the number of positive movements in series Y is 
a+c. The usual test of independence applied to the 2X2 table (see, e.g., [25], 
pp. 65-72), when appropriate, would indicate whether a differs significantly 
from its estimated expected value (a+b)(a+c)/n assuming U and V are inde- 
pendent; whether a+d differs significantly from its estimated expected value 


[((a + b)(a +c) + (6+ d)(c+d)]/n; 


whether (a+d)—(b+c) differs significantly from its estimated expected value 
[(a+b) —(c+d) |[(a+c) —(b+d) |/n; etc. (In passing we note that 


[a — (a+ b)(a+c)/n] = {a +d) — [@+b)(a+c) + (6+a)(c+d)]/n}/2 
= {[@+ 4) — 6+e¢)] —[@+b)-€+0H][@+o —-6+4]/n}/4 
= (ad — be)/n). 


Since (a+b)/n measures an aspect of the “trend” in X and (a+c)/n measures 
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an aspect of the “trend” in Y, this test when appropriate would indicate whether 
the number of comovements differs significantly from what could be expected, 
taking into account the trends in X and Y; i.e., it would test whether the ob- 
served number of comovements can be “explained” by the trends in X and Y 
alone. 

When both series X and Y are processes with independent stationary incre- 
ments [7, pp. 96-8], the usual test of independence applied to the 2X2 table 
given above will be appropriate. (For such series, the signs of the first differ- 
ences will be mutually independent identically distributed random variables.) 
This test can also be justified under somewhat more general conditions, which 
will be discussed in Section 2 herein. When both series X and Y are purely ran- 
dom processes (see [34]), the usual test, as was noted in reference [23], cannot 
be justified. Simple modifications of this test will be presented in Sections 3 and 
4, which will be appropriate when the series are purely random processes or 
when they are particular kinds of generalizations of such processes. 

The usual test of independence for a 2X2 table (see, e.g., [25]) is a two-tailed 
test in the sense that it tests whether A= [a—(a+b)(a+c)/n] is significantly 
large in absolute value. This test of the null hypothesis that the expected value 
of A is zero can be modified to obtain a one-tailed test against the alternate hy- 
pothesis that the expected value of A is positive and another one-tailed test 
against the alternate hypothesis that it is negative (see [25, p. 72]). Similarly, 
the results presented herein can be used to obtain either two-tailed or one-tailed 
tests. A one-tailed test will indicate whether there are a significantly large 
number of comovements (taking into account the trends) ; a different one-tailed 
test will indicate whether there are a significantly small number of comovements. 

The 2X2 cross classification table given above and in reference [23] is based 
on a dichotomous classification of the W; indicating whether the W; are posi- 
tive or negative, and a similar classification of the Z;. For the sake of simplicity, 
we shall confine our attention in the main to situations where this method of 
classification is used. Some of the results presented herein can be modified in 
a straightforward fashion for situations where other fixed methods of classifica- 
tion of the W; (and Z;), into two or more classes based on the magnitudes of the 
W; (and Z,), are used. In addition, various other approaches to the analysis of 
cross classification tables could be applied here to the analysis of the observed 
joint distribution based on the classification of the n pairs (W;, Z;). For ex- 
ample, besides testing the null hypothesis that the W; and Z; classifications are 
statistically independent, it sometimes may be worthwhile to measure the ex- 
tent of the observed association between these classifications (see, e.g., [13] 
and the literature cited there). The particular approach that will be appropriate 
will depend on the purposes of the particular investigation at hand. 

It should be mentioned that the term “comovement” used here is adopted 
from earlier work by M. Friedman [9]. It is also utilized in reference [14], 
where there is some discussion of the information obtained by an examination 
of comovements as contrasted with the information obtained by an examina- 
tion of correlations between time series. 

An index of association between two time series based on the observed propor- 
tion of comovements to total movements was proposed by G. T. Fechner [8] 
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and others (see, e.g., [26], [29], and related comments in [19]). A test based on 
the observed number of comovements was suggested by A. Stuart [30]. These 
earlier writers did not take into account simultaneously both (a) the time trends 
that may exist in the two series (except insofar as the signs of first differences 
were used) and (b) the serial correlation between the signs of the first differences 
when the series are purely random processes or are particular kinds of gen- 
eralizations of such processes. (The Stuart test [30] takes into account (b) but 
not (a), while the Moore-Wallis test [23] takes into account (a) but not (b).) 

For further discussion and explication of the problem of testing the signifi- 
cance of correlations between time series, the reader is referred to references 
{1], [15], [20], [21], [24], [27], and to the literature cited there. 


2. TIME SERIES WITH INDEPENDENT STATIONARY INCREMENTS 


We first consider the situation where time series X and Y are processes with 
independent stationary increments (see reference [7], pp. 96-8). Although the 
term “process with independent increments” usually refers to the continuous 
time parameter case, it has been defined also for the discrete time parameter 
case, which is the case considered here. Processes of this kind have sometimes 
been called “additive processes” or “random walks” (see Bartlett [4]). In this 
situation, Wo, W:, -- +, W.» are mutually independent identically distributed 
random variables (m.i.i.d.r.v.) and so are Zo, Z1, ++ +, Zn; thus Uo, Ui, ---, 
U, are m.i.i.d.r.v. and so are Vo, Vi, ---, Va. If W; and Z; are statistically 
independent, then U; and V; will also be statistically independent, and the 
n pairs of observations (U;, V;) can be interpreted in this situation as a 
random sample from a multinomial population described by a 2X2 cross classi- 
fication table where the classifications are independent. Thus, in order to test 
the null hypothesis that W; and Z; are statistically independent, the usual test 
of independence in a 2X2 table can be applied in this situation to the table 
describing the n pairs of observations (U;, V,). 

The preceding justification for the use of the usual test of independence in 
the 2X2 table is based on the fact that the Uo, U1, - ++, Uw are m.i.i.d.r.v. 
and so are the Vo, Vi, ---, Va; but it is actually not necessary that the Wo, 
Wi, -- +>, Wa be m.i.i.d.r.v. nor that the Zo, Z:, - - - , Za: be. Thus, even if the 
X and Y time series are not processes with independent stationary increments, 
the approach described above can be justified as long as the derived random 
variables Uo, Ui, - - - , Ux are m.i.i.d.r.v. and so are the Vo, Vi, -- -, Va. 

If the Uo, Ui, - - + , Uq are m.i.i.d.r.v. and so are the Vo, Vi, -- +, Va, then 
the usual “exact” test of independence in a 2X2 table applied to the n pairs 
(U;, V4) will be optimum as compared with other tests, based on the (U;, V;), 
of the hypothesis that the U; and V; are independent (see e.g., [33] for one- 
tailed tests, and [32]). However, if additional assumptions or restrictions are 
made about the time series X and Y, there may be preferable procedures; e.g., if 
the Wo, Wi, - - - , W. are m.i.i.d. normal r.v. and so are the Zo, Z1, - + +, Zn’, 
then the usual test of the null hypothesis that the correlation between two nor- 
mal variates is zero applied to the n pairs (W;, Z;) will be optimum as compared 
with other tests, based on the (W;, Z;), of the hypothesis that the W; and Z; are 
independent (see [2]). The particular statistical method that should be applied 
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will depend very much on the structure of the time series. In the following sec- 
tions, statistical methods will be presented for time series that are different 
from those discussed in the present section. For a discussion of the statistical 
analysis of the relationships between time series that are quite different from 
those discussed in this paper, the reader is referred to some of the literature cited 
earlier herein. 


3. PURELY RANDOM TIME SERIES AND A GENERALIZATION THEREOF 


Let us consider the special case where Xo, Xi, - - - , X, are m.i.i.d.r.v. and so 
are the Yo, ¥:,+-, Yn. Im this case, Pr{ U;=1} =Pr{V,=1} =4 for i=0, 
1,---+, mn’. If U; and V; are independent, then Pr{U;=V,=1} =}. For the 
sake of simplicity, we shall assume throughout this section that the number n is 
even. We shall make use of the basic fact that in the situation considered here 
the Uo, Us, Usy- ++, Una are m.iid.r.v., the Ui, U3, Us,---, Un are 
m.i.i.d.r.v., the Vo, V2, Vs, -- +, Var are m.ii.d.r.v., and so are the Vi, V3, 
Vs, -- +, Va. The total number of positive comovements between series X and 
Y is 


a= ‘> UW, 


i=( 


which is the sum of 


m!’ m’ 
a, = Zz UsiV 2; and a2 = » UsigiV 241, 


i=0 t=0 


where m’=m—1 and m=n/2. The a; and az are binomial variates, and the co- 
variance between them is 


C{ai, a2} = [(2)*? — ()*]m — DP = — (n — 1)(5/144) 


(see Appendix). The usual large sample theory for binomial variates (see, e.g., 
[22, p. 139]) indicates that a; and a, (when properly normalized) will be ap- 
proximately normally distributed. The sum a of a; and a; will be approximately 
normally distributed if the joint distribution of a; and a2 is approximately nor- 
mal, which can be proved using a central limit theorem for dependent random 
variables [17, pp. 776-8]. (The fact that the distribution of a is approximately 
normal can be seen to follow directly from this central limit theorem, but we 
have introduced the a; and a; to show how the distribution of a is related to the 
usual theory for binomial variates.) 

The formula for C{ a, a2} given above can be used to compute an exact ex- 
pression for the variance of a; namely, 


o*{a} = (3n/16) — (n — 1)(5/72) = (17n/144) + (5/72). 


In the situation considered in the preceding section where Uo, Ui, - - + , Uy are 
m.i.i.d.r.v. and so are the V1, V2, -- +, Va, assuming that Pr{ U;=1} =Pr{ V; 
=1} =}, the variance of a can be seen to be equal to 3n/16. Thus, the variance 
of a in the present situation (due to the negative covariance between a; and az) 
is less than the variance in the preceding situation. The decrease in the variance 
equals (n—1)(5/72). 
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Since the relative decrease in variance can be as large as 37 per cent, it is 
obviously important, when dealing with the situation considered here, to use the 
appropriate variance formula; i.e., to “correct” the variance formula given for 
the situation considered in the preceding section in order to take into account 
the negative serial correlation between U;V; and U i: Vig (¢=0,1, «+ + ,n’—1). 

The approach presented above can be used to obtain the approximate dis- 
tributions of the number d of negative comovements, the number b+c of con- 
tramovements, etc. It can also be used to show that the approximate distribu- 
tion of the number of positive movements in a time series, which was presented 
in references [20], [23] and [30], is in fact the approximate distribution of the 
sum of two correlated binomial variates. 

Let us now consider the more general situation where the Xo, X:,---, X, 
are mutually independent but need not be identically distributed, and the same 
is true for the Yo, Yi, - - - , Y.. In this case, we shall assume that 


Pr{ U; = 1} = Pi, Pr{V; = 1} = T;, Pr{U; = Ui = 1} = gi 


and Pr{Vi=Visa=1} =h; do not depend on i; ie., pi=p, ri=r, gi=g, and 
h;=h, fori=0, 1, - - - . (The condition that the g; and the A; are constant can be 
weakened somewhat in order to obtain some more general results, but this gen- 
eralization will not be discussed here (see [17, p. 774]).) If U; and V; are inde- 
pendent then Pr{ U;= V,;=1} =pr. Using the same approach as used earlier in 
this section, we find that a; and a, are binomial variates, and that 


E{a,} = E{a:} = mpr, E{a} = npr, o*{a:} = o?{a2} = mpr(1 — pr), 
C{a,a2} = [gh — (pr)*](n — 1), o*{a} = npr(l — pr) + 2(m — 1)[gh — (pr)?]. 


(A proof of the fact that C{a:, a2} <0 is given in the Appendix.) Thus, in the 
situation considered here, the variance of the number a of positive comovements 
is approximately n{ pr+2gh—3(pr)?} ; while in the situation considered in the 
preceding section, the variance of a can be seen to be npr(1— pr). It is possible 
to show that the distribution of (a:, a2) is approximately normal, that the dis- 
tribution of a is approximately normal, that the variance of a in the present 
situation is less than the variance in the situation discussed in the preceding 
section, and that the decrease in variance equals 2| Cha, a} | : 

We shall now determine the distribution of the difference between a and 
(a+b)(a+c)/n, its estimated expected value obtained by the usual 2X2 table 
approach, where 


a+b=>U;, at+tc=)> Vj. 


t=0 i= 


We have that 
E\a + b} = np, 
Eja+c} =nr, o%{a+b} = np(l — p) + 2(n — 1g — p?), 
o{a+c} nr(1 — r) + 2(n — 1)(h — 7), Cia +b,a+ c} = 0, 
and 


o*{(a + b)(a+c)} = (mr)*o*{a + b} + (np)*e*{a +c} + o%{a+b}o*{a+ec}. 
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Thus, the variance of A = { (a+b)(a+c) /n} is approximately equal to 
n[r*(p + 2g — 3p*) + p*(r + 2h — 3r)], 


which also can be seen to be an approximate formula for the covariance be- 
tween a and A. Therefore, the variance of a—A is approximately equal to 


n[pr + 2gh — 3(pr)* — r(p + 2g — 3p*) — p*(r + 2h — 3r°)] 
= n[pr(1 — p)(1 — v) + 2g — p*)(h — 1°). 


The distribution of a— A is approximately normal, as can be seen by a direct 
application of a central limit theorem for dependent random variables [17] or 
by the following approach. Let 


Then 





a— A =a; — Ai + a; — A2+ (Ai + Aa — A), 





where 


Ait A:-A= | = (Vein — vs) | | 7. (Vou — v2) | 2/m. 
i=0 i=0 

Since the variance of the random variable (A:+A:—A)/+/n will approach zero, 
this variable will approach zero in probability. Therefore, a—A is essentially 
equal to the sum of a:—A, and a:—Asz (see [6, p. 254]). By the usual large 
sample theory for 2X2 cross classification tables, we see that the distribution 
of a,— A, is approximately normal and so is the distribution of ag— A». Their 
sum can also be seen to be approximately normal with (asymptotic) variance 
equal to the sum of npr(1—p)(1—r) and twice the quantity n(g—p’)(A—r’), 
which is the (asymptotic) covariance between (a;—A:) and (a,—Az:). Since 
this covariance is positive (see Appendix) the asymptotic variance of a—A is 
found to be larger than npr(1—p)(1—r), which is the standard variance formula 
relating to the corresponding quantity in the usual 2X2 table (see [25, p. 71]). 

To illustrate the relative effect of the covariance between (a,;—A1) and (a2 
— A»), which is in fact the relative effect of the serial correlation in the U series 
and in the V series, on the variance formula for a— A, we shall for the moment 
consider again the special case where the X’s are m.i.i.d.r.v. and so are the Y’s. 
In this special case, we see that npr(1—p)(1—r) =n/16, that 2n(g—p*)(h—r?*) 
=n/72, and that the relative effect of the “correction for the serial correlation” 
is approximately 16/72 =2/9. Since this relative effect can be as large as 22 per 
cent, it is important, when dealing with the situation considered here, to use the 
appropriate variance formula; i.e., to “correct” the variance formula obtained 
in the usual analysis relating to 22 tables (e.g., [25, p. 71]) in order to take 
into account the serial correlation in the U series and in the V series. 

The quantity g—p? is the covariance between U; and Uj41, and h—r? is the 
covariance between V; and V ;,;. These quantities can be estimated consistently 
by 
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n’—1 n’—1 


e= >> UUins/n’ —[(a+b)/n]}? and f= >) VVigi/n’ — [(a + 0)/n}?, 


i=0 iz=0 
respectively. Thus 
(a — A)/Vn[(a + b)(a + c)(b + d)(c + d)/n* + 2¢f] 


will be approximately normally distributed with zero mean and unit variance, 
and the distribution of 


(ad — be)?n/[(a + b)(a + c)(b + c)(b +d) + 2n‘ef| 


will be approximately x? with one degree of freedom (see [25, p. 71]). By apply- 
ing the correction 2n‘ef to the usual x? statistic computed for the 22 table 
describing the joint distribution of the n pairs (U;, V;), we can test the null 
hypothesis that U,; and V; are independent when dealing with time series of the 
kind discussed in this section. 

It is interesting to note that e and f are closely related to the statistics com- 
puted in the usual 2X2 cross classification table analysis applied to the pairs 
(U;, Uiss) and to the pairs (V;, Vis1), which are appropriate for testing ran- 
domness in the U series and in the V series, respectively (see [10], [11]). For 
further details about this relationship and for some generalizations of the test 
suggested in this section, see [12]. 





4. A MORE GENERAL KIND OF TIME SERIES 


In the preceding section, we noticed that when the Xo, Xi, ---, X, were 
m.i.i.d.r.v. then the derived random variables Uo, Ui,---, Ux were not 
m.i.i.d.r.v., but the variables Uo, Us, Us, - + - , Una were (when n was taken 
as even), and so were the U;, U3, Us, -- -, Ux. In this section, we deal with 
more general time series where the Uo, Us, Us, - - - , Un. (or some other speci- 
fied subsequence of the U’s) are m.i.i.d.r.v., and the corresponding subsequence 
of the V’s are also m.i.i.d.r.v. Writing n’—-1=n*, we see that the n/2 =m pairs 
of observations (Uo, Vo), (U2, V2), -- ~, (Un*, Va*) can be interpreted as a 
random sample from a multinomial population described by a 2X2 cross classi- 
fication table; if the U2; and V2; are statistically independent, then the popula- 
tion table will exhibit complete independence. Thus, in order to test the null 
hypothesis that the U2, and V2; are statistically independent, the usual pro- 
cedures for testing independence in a 2X2 table can be applied to the table 
describing the sample of m pairs (U2;, V2;), 7=0, 1, -- -, m’. Thus, if the X 
and Y time series are of the more general kind described in this paragraph, then 
it is possible to modify the cross classification procedure presented in Section 2 
in order to obtain a similar method, applied to (approximately) half as many 
pairs (U;, V;), for studying an aspect of the relationship between the X and Y 
series. 

In general, if time series X is such that within the sequence of derived U’s 
there is a subsequence U,,, U;,, +--+ of m.i.i.d.r.v. and if time series Y is such 
that V,,, V.,--- are also m.i.id.r.v., then it is possible to apply the cross 
classification procedure presented earlier to the pairs (U;,, V;,) in order to study 
an aspect of the relationship between the X and Y series. (For example, if time 
series X is a process of moving averages of the form 
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k—-1 
Xi = Do aks 

j=0 
where the &; are (unobserved) m.i.i.d.r.v. ((=0, +1, +2,---), then the Wo, 
Wiss, Worse, Waeys,°*- are m.iid.r.v. (see [4], [34]), and so are the Us, 
Unis, Uoese, Users, - + + . Thus, if both times series X and Y are of this general 
form, then it is possible to apply the cross classification procedure to the pairs 
of observations (U;,, Vi,), where tj;=(*+1)j for j=0, 1, - - - .) Since it is possi- 
ble to test the null hypothesis of independence in a 2X2 cross classification 
table even when the observed frequencies are small (see, e.g., Pearson and 
Hartley [25], p. 65-71), it is not necessary that there be a large number of pairs 
of observations (U;,, V.;); but, of course, it is preferable to have more than less. 
When there are a large number of pairs (U;,, V:;), it will not be necessary to 
apply the “exact” test of significance to the 2X2 table, since the usual approxi- 
mate x? method will be satisfactory (see, e.g., [25], pp. 71-2). 

For some time series, it may be pessible to find several subsequences of the 

U’s (and the V’s) consisting of m.i.i.d.r.v.; e.g., if 


k-1 


Xi = Di asti-s, 


j=0 


then Uo, Uris, Usxse, Usess, ++ + is such a subsequence and so is U4, Uxye, 
Uoxs3, Usess, - - -. In such a situation, the cross classification procedure can be 
applied separately to each subsequence of pairs (U:,, V:,) obtaining separate 


tests of significance, although it must be kept in mind when interpreting the re- 
sults that the separate tests may not be statistically independent. In Section 3, 
a method was given that was based essentially on a combination of two such 
tests (i.e., tests based on a,;— A; and on a,—A2). The approach presented there 
could be generalized in order to obtain a method for combining two or more 
such tests in certain situations that are of the kind diszussed in the present sec- 
tion; but this generalization will not be presented here. 

In closing this section, we wish to emphasize that the tests suggested here 
were presented because of their simplicity. These tests are, no doubt, not as 
powerful against certain specific alternate hypotheses as are some other possible 
tests (see, e.g., Hannan [15]). 


5. THE PARTIAL RANK CORRELATION COEFFICIENT 


In Sections 2 and 3, the data appearing in the 2X2 table described the 
joint distribution of the n pairs (U;, V;), where U; and V; are functions of W; 
and Z;, respectively. In Section 4, it was suggested that, in some situations, 
various subsets of these n pairs might be studied separately. In the present sec- 
tion, we shall consider the n(n+1)/2 pairs (U;;, V;:), and various subsets of 
these pairs, where U;;=1 if X;—X,;>0 and U;;=0 if X¥;—X,<0, Vj;=1 if 
Y;—Y,;>0 and V;,=0 if Y;—Y,;<0, and j>1. (The preceding sections dealt 
with subsets of the n(n+1)/2 pairs where j=7+1.) 

A 2X2 table describing the joint distribution of the n(n+1)/2 pairs (Uj, 
V;:) could be formed (as was done for the (U;, V,)) and the usual x? statistic 
computed for this table will be related to the partial rank correlation coefficient 
rxy-r of the X and Y series with “time T held constant” since rxy.r= +/x°/N 
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(see Kendall [18], p. 99-104), where N =n(n+1)/2. The variance of partial r 
is rather complicated and depends on the actual population distributions of X, 
and Y, as functions of time ¢ (see [16]). No simple tests of significance are yet 
known for partial r. The x? statistic can not be used in the usual way to test the 
null hypothesis of independence between the U;; and the V ;; due to the fact that 
the n(n+1)/2 pairs are not independent observations. If a test of significance 
for partial r were available, it would probably be more powerful against certain 
kinds of alternate hypotheses than the simple tests suggested herein, but it 
would probably also require more computation than these simple tests. For 
some further discussion of this general problem, see reference [12]. 

In some situations, various subsets of the n(n+1)/2 pairs (U;;, V;;) might be 
studied; e.g., the n pairs where j=i+1, the (n—1) pairs where j7=7+2, the 
(n+1)/2 pairs where j7=i+(n+1)/2 for i=0, 1, ---, (n—1)/2 (for n odd), 
the (n+1)/3 pairs where j=i+2(n+1)/3 for i=0, 1, - - - , (n—2)/3 (for n+1 
divisible by 3). (The (n+1)/3 values of U;;, where j7=i+2(n+1)/3 for i=0, 
1, -- +, (n—2)/3, were used in reference [5] to test for trend in the X time 
series.) Tests of significance can be derived for these subsets using the same 
general approach as developed earlier herein (see [12]). The particular test to 
apply will depend very much on the structure of the time series being investi- 
gated and on the particular alternate hypotheses under consideration. 


6. TWO NUMERICAL ILLUSTRATIONS 


We first consider the relationship between the amount of investment made 
and the amount of profit received by eight large corporations in the U. §S. in 


TABLE 1. THE SIGNS OF FIRST DIFFERENCES OF INVESTMENT (W) AND 
PROFITS (Z) OF EIGHT LARGE CORPORATIONS IN THE U. 8. 








Years sgn W sgn Z 








1935-36 
1936-37 
1937-38 
1938-39 
1939-40 
1940-41 
1941-42 
1942-43 
1943-44 
1944-45 
1945-46 
1946-47 
1947-48 
1948-49 
1949-50 
1950-51 
1951-52 
1952-53 
1953-54 


b+++14++ 
l+++14+4 


ote 


L+++ititi¢i 


+t i +++ 











Source: [14], p. 164. 
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the period 1935-1954. Investment is measured by the amount of additions to 
plant and equipment plus maintenance and repairs deflated by the implicit 
price deflator of producers’ durable equipment. Profits are measured by net in- 
come to surplus (after taxes) deflated by the implicit price deflator of gross na- 
tional product. The eight corporations included are General Motors, General 
Electric, United States Steel, Atlantic Refining, Union Oil, Diamond Match, 
Goodyear Tire and Rubber, American Steel Foundries. (For a more detailed 
description of the variables studied, see [14].) 
The data of Table 1 can be summarized in the following 2 X2 table: 

















Application of the usual test of independence to this 22 table, as suggested 
in reference [23] and in Section 2 herein, would lead to the computation of the 
statistic 

/ =_ 9 

al Vi9 (9 X 4—4 X 2) = 147. 

V13X6xX11X8 
The distribution of the statistic K is approximately normal with zero mean and 
unit variance, in the case where the U’s are m.i.i.d.r.v., the V’s are m.i.i.d.r.v., 
and there is independence between the U and V. Thus, under the null hypothe- 
sis that there is independence between the U and V, the probability of obtain- 
ing the observed value of K or a larger one is approximately .071 (when the 
time series are of the type discussed in Section 2). Thus, in the situation where 
a one-tailed test is to be performed at the 5 per cent level of significance, we 
would accept the null hypothesis, and would conclude that the observed num- 
ber of comovements can be “explained” by the trends in the two time series. 
Since the null hypothesis was not rejected using the method presented in Sec- 
tion 2, it also would not be rejected if the method proposed in Section 3 were 
applied to the observed data. 

The normal approximation for the statistic K might be improved somewhat 
if the usual “continuity correction” were applied to it; this correction was not 
applied here nor elsewhere in this section, in order not to detract the reader’s 
attention from the main issues under consideration. In practice, application of 
the usual “continuity correction” to the K statistic is to be recommended when- 
ever the time series are of the type discussed in Section 2. 

We now consider the relation between the velocity of circulation of banking 
deposits and the monthly change in the consumer price index in Israel for the 
period April 1956 to March 1958. Table 2 shows the signs of first differences of 
the two time series for each month in the period considered. 
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TABLE 2. THE SIGNS OF FIRST DIFFERENCES OF VELOCITY OF CIR- 
CULATION OF BANKING DEPOSITS (W) AND THE SIGNS 
OF SECOND DIFFERENCES OF THE CONSUMER 
PRICE INDEX (Z) IN ISRAEL 








Months sgn W sgn Z 





April 1956 
May 1956 
June 1956 

July 1956 
August 1956 
September 1956 
October 1956 
November 1956 
December 1956 
January 1957 
February 1957 
March 1957 
April 1957 
May 1957 

June 1957 

July 1957 
August 1957 
September 1957 
October 1957 
November 1957 
December 1957 
January 1958 
February 1958 


March 1958 


i+ td + 3 





+t hits +f 
De de ee ee 


t+ | 











Source: [3], pp. 38-9. 


The data of Table 2 can be summarized in the following 2X2 table: 
V 
U 


Ay 


0 


ag 
| 





5 12 





9 12 











7 
3 | 
10 | 14 | 24 


If we assume that the time series considered are of the type discussed in Sec- 
tion 2, we can apply the usual test of independence to this 2X2 table. In this 
case, we would compute the statistic 


(7X9—5X3)+/24 
/12 X 12 X 10 X 14 


Under the assumptions stated in Section 2, the distribution of the statistic K, 
(when there is independence between the U and V series) is approximately nor- 


r 


1 
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mal with zero mean and unit variance. The probability of obtaining the ob- 
served value of K, or a larger one, under tne null hypothesis that there is inde- 
pendence between U and V, is approximately .049. 

If we assume that the two time series are of the general type discussed in Sec- 
tion 3, then we have to introduce the “serial correlation correction.” In this 
case, we would compute the statistic 


(7*9—5 X 3)/24 


pm : 
* 4/12 X 12 X 10 X 14 + 2 X 24* X (—.1196) (— .0432) 








235.152 
~ 4/20,160 + 3,431 


= 1.53. 








Under the general assumptions stated in Section 3, the distribution of the sta- 
tistic K, (when there is independence between the U and V series) is approxi- 
mately normal with zero mean and unit variance. The probability of obtain- 
ing the observed value of K, or a larger one, under the null hypothesis that 
there is independence between U and V, is approximately .063. 

In this example, we see that in a situation where a test is to be performed at 
the 5 per cent level of significance we would reach different decisions depend- 
ing on the assumptions made about the type of time series under consideration. 
If we assumed that the time series were of the type considered in Section 2, we 
would reject the null hypothesis that there is independence between U and V. 
If we assumed that the time series were of the type considered in Section 3, 
we would accept the null hypothesis, and would conclude that the observed 
number of comovements can be “explained” by taking into account the trends 
in the series and the serial correlation of the U’s and of the V’s. 

In closing this section we note that, in this particular example, the serial cor- 
relation correction increased the estimate of the variance of the “numerator” 
of K, and Kz by about 17 per cent, which is not very different from the 22 per 
cent referred to earlier (see Section 3) in the special case where the time series 
are m.i.i.d.r.v. 


7. THE RELATION BETWEEN THREE OR MORE TIME SERIES 


For the sake of simplicity, we shall limit the discussion here to the study of 
the relations between three time series; this discussion can be generalized to the 
study of more than three time series as well. We shall assume that these three 
time series, X, X@, Y, are of the kind described in Section 2, where X‘ 

= {Xo x?,-- xo} and X®={X?, X?,.--, X@}. As in Section 
2, we define U Wa when X¥,-—X?>0 ond usa0 when X{},-—X? 
<0 for j=1, 2. We also define V;=1 when Yi4,;— Y;>0 and V;=0 when Yi41 
—Y,<0. A 2X2 X2 cross classification can be formed describing the observed 
joint distribution of the n triplets (U$”, U?, V,), and the usual procedures 
for testing various hypotheses in 222 cross classification tables can be ap- 
plied to this observed table. For example, the null hypothesis that the U!, 
U®, and V; are mutually independent can be tested (see, e.g., [22, p. 280); 
this will test the hypothesis that the observed number of positive and negative 
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comovements in the three time series and the observed number of contramove- 
ments of various kinds (6 different kinds) can all be “explained” in terms of the 
number of positive movements in the X“ series, the number of positive move- 
ments in the X series, and the number of positive movements in the Y series; 
i.e., that the observed data can be explained by the “trends” in X™, X™, and 
Y. It is also possible to test the null hypothesis that V; and (U!’, U?) are 
independent (see, e.g., [22, p. 281]) ; this will test the hypothesis that the move- 
ments in series Y are independent of the joint movements in the X®, X® 
series (i.e., that the positive and negative movements in Y are independent of 
the positive and negative comovements and the two kinds of contramove- 
ments in X“ and X®), This test studies whether the observed data can be ex- 
plained by the trend in the Y series and the joint movements in the X, 
X series. In a certain sense, it tests whether the multiple association (correla- 
tion) between V and (U™, U) is zero. 

In some situations, a test of whether the multiple association between, say, V 
and (U“, U®) is zero may be of less interest than the actual measurement of 
the strength of the observed association between V and (U“, U®). For some 
discussion of the problem of measuring this multiple association, the reader is 
referred to [13, pp. 761-2]. For some discussion of the problem of measuring the 
partial association between, say, V and U™, when U™) is “held constant,” see 
[13, pp. 760-1]. It is also possible to test whether the partial association be- 
tween V and U“™ is zero when U™ is “held constant.” This test studies whether 
the number of comovements in series Y and X“ at times when there have been 
positive movements in the X series can be explained by the number of posi- 
tive and negative movements in series Y and series X“ at these times, and also 
whether the number of comovements in series Y and X“ at times when there 
have been negative movements in the X“ series can be explained by the 
number of positive and negative movements in series Y and series X“ at those 
times. 

In reference [23], there is some discussion of the study of the relationships 
between three time series, and hypotheses different from those mentioned here 
are presented. For some further discussion of the analysis of multi-way cross 
classifications, the reader is also referred to reference [28] and to the literature 
cited there. 


APPENDIX 


Let us assume that the time series considered are of the general type defined 
in Section 3 herein. We shall now prove that the covariance C{ Uj, Ui} be- 
tween U,_, and U; is negative; i.e., that 


Pr{ Uis _ U; = 1} = Pr{Uis = 1} Pr{ U; = 1} a = p? < 0. 


It will follow from this that the covariance C{V i+, Vi} between V; and V; is 
negative; i.e., that h—r?<0. Furthermore, the covariance C { a, a} between a 
and a», defined in Section 3, will be negative, since 


C{ai, a2} = [gh — p*r*](n — 1). 


Thus, the “correction for serial correlation,” 2(n—1)[gh—(pr)?], appearing in 
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the variance formula for a, is negative; while the “correction for serial correla- 
tion,” 2n(g—p*)(h—r*), appearing in the variance formula for a—A, is positive. 
It is only necessary to show that 


Pr{ Uva = Us = 1} = Pr{ Xe < Xi < Xe} < Pr{ Xin < Xi} Pr {| Xi< Xia. 
For the sake of simplicity, we shall assume here that these probabilities are 
neither zero nor one, and that the random variable X,(i=0, 1,---) has a 


probability density function g,;(z). Denoting the cumulative distribution func- 
tion of X; by G;(x), we see that 


Pr{ Xe < X,} -f Gi-1(z)gi(x)dx 


> f " Gealagda)de + Ges(0)[1 — G.(0)] 


e ° jl — G,(c) | 
> f G,-x(z)gi(z)dz + f G.-1(2)9:(2)d2 ——— 
aie ~~ G;(c) 

i f G.-1(z)9.(x)dx/G:(c), 
where c is chosen so that 0<G;_1(c) <1 and 0<G;,(c) <1. Thus, 
Pr{ Xin < Xx} Gi(c) >f Gy-1(z)gi(x)dz, 


so that 


+o +0 vy 
Pr{Xi1< Xi f Gilu)gena(ydy > f f Gi-s(2)9:(a)degiaa(y)dy; 


from which it follows that 
Pr{ Xi < X;} Pr\ X; i< Xin:} > Pr{ Xi Ci Ese Xiai}. Q.E.D. 
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SOME ASPECTS OF SEASONALITY IN THE 
CONSUMER PRICE INDEX 


H. E. Riiey 
International Labour Office* 


The Consumer Price Index of the Bureau of Labor Statistics is sub- 
ject to a variety of seasonal influences, some of which are related to 
supply and some to demand factors. Recent calculations of seasonal 
indexes for the CPI and its major components reveal that there is little 
seasonality in the index as a whole, although some items show seasonal 
price movements of considerable magnitude. 

Although B.L.8. has not thus far published a seasonally corrected 
CPI, it has given attention to the practical problems of seasonality. 
Widespread use of the CPI for wage escalation emphasizes the need for 
consideration of the problem. 


HE Consumer Price Index of the United States Bureau of Labor Statistics 
Tis issued without adjustment for seasonality. Although frequent reference 
is made in reports and releases to seasonal aspects of price movements, there 
has been no official publication of seasonal adjustment factors. Nevertheless, 
considerable attention has been devoted to problems of seasonality in the CPI 
both by B.L.S. and by users of the index. This article examines, largely in 
non-technical terms, several practical aspects of the subject, and presents some 
recently developed seasonal factors for the CPI. 

Seasonality can be approached from two directions. One involves the de- 
tection and analysis of the periodic movements in the index series through the 
application of standard statistical methods. The other refers to the problems of 
a seasonal character which are encountered in the course of gathering the basic 
data, and which may or may not be “corrected” at that stage, depending upon 
the nature of the problem and the purpose of the index. This group, in fact, 
comprises two types of problems: those which must be tackled one way or 
another before the index is calculated, and those phenomena which may be 
dealt with at the stage of price collection if the intention is to produce an index 
free of seasonal influences. The latter two types include the problems of com- 
modities that are not in the market at certain seasons, those that are subject 
to periodic price movements related to model changes, and seasonal variations 
in consumption patterns. 

Consideration of these various seasonal questions in relation to the CPI 
must also take into account the scope and objectives of the index. The CPI is 
defined as “a measure of changes in prices of the goods and services bought by 
families of city wage earners and clerical workers.”! The quantity weights used 
at present in the index formula represent the annual amounts of each com- 
modity or service used by the “index” families in 1951-2, the base-weight years. 
From the thousands of specific goods and services entering into the typical 
wage earner family’s pattern of living, about 300 items were selected for the 





* Views expressed are the author's own and do not necessarily reflect the views of the International Labour 
Office. 
Techniques of Preparing Major B.L.S. Statistical Series.” Bul. No. 1168, U. S. Department of Labor. p. 63. 
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actual pricing sample. By imputation, these items carry the entire weight of 
the average base year family expenditure. 

For most of the commodities and services in the sample list, prices are col- 
lected from representative retail stores or service establishments in 46 cities, 
geographically distributed throughout the country and ranging in size from 
the largest urbanized areas to small towns of under 3,000 population. Food 
prices are collected monthly in each city. Other commodities, as a rule, are 
priced quarterly in a rotating subsample of cities, selected so as to provide a 
current set of prices each month for each city size group and geographical area. 
All items are priced monthly in the five largest urbanized areas (New York- 
Northeastern New Jersey; Chicago; Los Angeles; Detroit; Philadelphia). 
Prices of some components, such as house purchase and financing, fuels, utilities 
railroad fares, and insurance, are obtained by mail or from secondary sources. 

One of the important features to bear in mind is that the index is designed to 
reflect the trend in prices paid by a particular segment of the population— 
urban middle income families of wage earners and clerical workers. These 
families as a rule receive relatively fixed incomes, derived from weekly pay- 
checks. Their incomes are quite evenly distributed throughout the year, and 
they tend to spend income as it is received. Some seasonal and random varia- 
tions occur, of course, as families borrow or spend from savings for vacations, 
Christmas presents, emergencies, etc. By and large, however, the average index 
family probably spreads the bulk of its expenditures fairly evenly throughout 
the year. To the extent that prices in the aggregate are affected by the current 
volume of spending, it may be surmised therefore that the CPI would exhibit 
very little seasonality. This is not to deny, of course, that there is considerable 
seasonality in both prices and consumption of some of the major commodities 
in the index. As prices decline seasonally for one segment, they may rise in 
another. 

Seasonality in consumer prices is caused by a variety of influences, some on 
the demand side and some on the supply side. In both cases climatic factors are 
the most important, although by no means the sole causes of periodic fluctua- 
tions in prices. On the demand side, people have different needs and desires in 
warm weather than in cold. Seasonal consumption patterns arising from this 
factor should exhibit about the same contour year after year, although the 
price effects will be modified by supply factors, including possible substitutions. 
The magnitude of this climatically influenced seasonal demand, with respect 
to the national index, may change over time as a result of major population 
shifts. Thus the effect of seasonal movements in domestic heating fuels on the 
index may diminish as population shifts toward the South and West. Custom 
also plays a large part in seasonal variations on the demand side. Christmas, 
Easter, vacation periods and other habitual practices contribute to the rise and 
fall in consumption. 

On the supply side, the greatest seasonal price changes result from variations 
in agricultural production, especially among the perishable foods. Consumer 
demand for these items appears to be fairly stable throughout the year, with 
the result that limited supplies in the “off” seasons command premium prices. 
This situation offers an inducement to the suppliers to find ways of marketing 
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these products during the high price periods. In addition, the producers seek 
to improve efficiency and reduce overhead costs by stabilizing production 
throughout the year. Great progress has been made in developing cold resistant 
crops, improved transportation, storage and preservation of foodstuffs as a 
means of stabilizing marketable supplies. 

The seasonal movement of automobile prices has been the subject of con- 
siderable controversy. This is the outstanding example of a seasonal price 
change generated by the industry’s practice of introducing annual model 
changes. B.L.S. has been pricing the three low-priced volume sellers—Ford, 
Chevrolet, and Plymouth. Each month the B.L.S. field agents obtain direct 
from dealers the actual transaction prices for standard models of each car. The 
net price as reflected in the index is, in effect, the list delivery price, less dis- 
counts and less over-allowances on trade-ins. These monthly prices exhibit a 
very pronounced seasonal pattern, generated by the annual model change. 
When the new models are introduced in the fali, prices rise sharply. Under 
market conditions of the past year or so, some discounting of the new models 
has occurred almost immediately after their introduction. As the year goes on, 
the discounts increase in size and the average net transaction price declines. 
By September or October the price of the “old” model is at its lowest, and 
herein is the source of the argument. The current model, sold in September at 
a substantial discount, may be brand new stock, just off the assembly line and 
equivalent in every mechanical respect to a similar car produced at the be- 
ginning of the model year. Because of the imminence of a new model, however, 
the buyer places a lower value on the current model. It may be argued that 
this represents a quality change for which an adjustment should be made in 
computing the index. The effect of such an adjustment would be, of course, to 
eliminate the periodic price movement. B.L.S., however, looks upon this annual 
price variation as a seasonal phenomenon and, in accordance with its present 
policy, does not attempt to eliminate it from the index. 

It has been suggested that the seasonal peak of automobile prices in the CPI 
may be overstated by the practice of dropping the old models from the index in 
the month in which the new models are introduced. There is no doubt that this 
has occurred, although in years of normally good sales most dealers “unload” 
the old models before introducing the new. B.L.S. has recently attempted to 
solve the problem by continuing to price the old models for a period after intro- 
duction of the new, weighting the two prices by relative volume of sales. It was 
found, however, that even when a considerable number of old models remained 
near the end of the model years, their inclusion had little effect on the move- 
ments of the index as a whole. Another factor tending to spread out the effect 
of the model change is that the new models are not all introduced in the 
same month. 

The introduction of annual model changes seems to be spreading to other 
consumer durables, such as television sets and kitchen and laundry equipment. 
Under present B.L.S. practice, however, the effects may not be quite as sharp 
as in the case of automobiles, as the new model is generally priced only after it 
becomes the volume seller in the store. Thus, old models may be priced in some 
outlets after new models have been substituted in the list for others. 
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Seasonal movements in the supply and prices of meats, fruits and vegetables 
are easily recognized, although the turning points and the magnitude of seasonal 
fluctuations may vary widely from year to year. Reference has been made to the 
problem of pricing items, such as fresh fruits and vegetables, which may 
virtually disappear from the market during certain periods each year. The same 
problem exists also in pricing some seasonal articles of clothing. Included among 
the fruits in the CPI are grapefruit, peaches, strawberries, grapes and water- 
melons. Strawberries, for example, are generally available in quantity during 
the months of April, May and June only. B.L.S. is unable to obtain representa- 
tive prices during the other months of the year. But every item in the index 
is represented by a base quantity weight which is uniform for each month in 
the year. Lacking a price for an item in an “off” month, a choice must be made 
between two alternatives: to use a nominal price, such as the last price ob- 
tained, or to shift the weight to another item for which a current price is avail- 
able. For the CPI the second has been adopted. The weight of an unpriced 
fruit is, in effect, shifted to the other fresh fruits. This implies the assumption 
that nominal prices of the items not in the market follow the average move- 
ments of the year-round fruits. Usually the index for fresh fruits exhibits a 
sudden rise in the month when the price is reintroduced. In 1954, for example, 
strawberries came on the market at an average price of 38.7 cents. At the close 
of the season in June 1953 the price had averaged 27.2 cents. The year-round 
fresh fruits, on which the index was based during the “off” months, exlibited 
much less seasonality. Apples, for example, were 19.4 cents in June 1953, as 
against an average for the year 15.7 cents, and 15.9 cents in April 1954, com- 
pared to the average in that year of 15.3 cents. 

Seasonal changes in the supply of perishable fruits and vegetables are ac- 
companied, of course, by variations in the volume of family consumption of 
those items. In this situation, the use of fixed and uniform weights, identical 
for all months of the year, obviously does not produce a monthly index which 
measures the changing cost of the family’s actual consumption. This is con- 
sidered by many index users to be a serious weakness, and various methods 
have been suggested for the application of seasonally varying weights.? 

A seasonal weighting method now employed in the Canadian consumer price 
index is based on the observation that total monthly expenditures on the group 
of foods composed of fruits and vegetables, fats, eggs and meats were relatively 
constant, although purchases of individual items within the group varied from 
month to month. Accordingly, the monthly total weight for the group is held 
constant, but the weights of the individual components vary.’ 

Unfortunately, the application of seasonally varying weights produces an 
index which does not conform with the traditional criteria for a measure of 
“pure” price change. The pure price index should possess two essential char- 
acteristics: 1) between two points in time the index must be a function of 





? Bruce D. Mudgett, “The Measuremn, of Seasonal Movements in Price and Quantity Indexes,” Journal of the 
American Statistical Association, 50 (1955, ©%-8; and Doris P. Rothwell, “Use of Varying Seasonal Weights in Price 
Index Construction,” Jbid. 53 (1958) 66-77. 

It is interesting that Wesley C. Mitchell in his classic “The Making and Using of Index Numbers” (B.L.S., 
Bulletin No. 173, 1959) makes no reference to seasonality. Most of the series to which he referred showed only an- 
nual figures. 

* The Consumer Price Index, Jan. 1949~Aug. 1952. Dominion Bureau of Statistics, Ottawa, Canada. 
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changes in prices of identical commodities; and 2) the weights used in com- 
puting the index may not change between any two periods being compared. 
The use of changing weights may produce a change in the index between two 
periods when there have been no changes in the prices of the individual com- 
ponents. In effect, the seasonally weighted index takes on the attributes of an 
index of expenditures, rather than of prices alone. 


1. COMPUTATION OF SEASONAL FACTORS FOR THE CPI 


Although B.L.S. has thus far resisted proposals that it undertake the regular 
publication of a seasonally adjusted CPI, considerable attention has been de- 
voted to the problem, and several sets of seasonal factors developed. The most 
recent and comprehensive analysis of seasonality in the CPI was carried out 
by the method developed at the United States Bureau of the Census.‘ This 
procedure involves the computation of a moving seasonal, using a variation of 
the usual ratio-to-moving average method. Stable seasonal factors have been 
computed by averaging the moving seasonal for the 12-year period, 1947-58, 
for which the data are available. 

The stable seasonal factors, for the CPI as a whole and for selected com- 
ponents, are shown in Table 1. It will be seen that the range of seasonality in 
the CPI is not wide, from 99.6 in February to 100.3 in September, a spread of 
0.7 points. 

As expected, the fruits and vegetables exhibit wide seasonal variations in 
prices. For the fruit and vegetable group as a whole (not shown) the seasonal 
factors ranged from 93.0 in October to 108.3 in June, a difference of 15.3 points. 
Oranges, potatoes, and tomatoes with ranges of 24.2, 25.5 and 61.3 points, 
respectively, represent nearly one-fourth of the total weight in this group.® 
Their seasonal movements tend to be somewhat offsetting, however. Thus, the 
peak factor for oranges coincides with the lowest factor for potatoes. 

The effects of some of the seasonal phenomena described above can be seen 
in Table 1. House furnishings, for example, are at their lowest point in August. 
This probably represents chiefly the effect of the August “white sales” of sheets, 
towels and other household fabrics. Most of the new car models are introduced 
in October or November, with the result that their seasonal price peak is 
reached in November. On the other hand, the higher discounts on old models 
immediately before the new model year brings new car prices down to their 
lowest point in September. 

Table 2 shows the range between the high and low seasonals of the selected 
items for each year from 1947 through 1958. The data indicate that seasonality 
in the index as a whole has been steadily decreasing since 1947. Among the 
selected components, beef and veal, pork, and fats and oils exhibit the same 
tendency to a pronounced degree. Seasonality in poultry prices has, however, 





4 See Julius Shiskin and Harry Eisenpress, “Seasonal Adjustment by Electronic Computer Methods,” Journal 
of the American Statistical Association, 52 (1957) 415-49, and “The B.L.S. Seasonal Factor Method,” a paper pre- 
sented by Abe Rothman of B.L.S. at the Annual Meeting of the American Statistical Association, Palo Alto, Cali- 
fornia, 1960. 

5 The relative importance figures, shown in the last column of the table, indicate the weights attached to the 
various items at the time of the last index revision in 1952, The relative importance changes over time as a result of 
changes in price relationship among the items. For the complete table of relative importance, see Techniques of 
Preparing Major B.L.S. Statistical Series, Ibid. pp. 73-5. 
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TABLE 2. RANGE OF FLUCTUATION IN SEASONAL FACTORS, 1947-1958 
All Items and Selected Items 
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increased significantly over the period. These divergent movements can be ex- 
plained by the trends of production and prices of these commodities during the 
post-war period. Poultry production has been increasing rapidly since the end 
of the war and prices have been generally, declining since 1952. Cyclical factors 
in livestock production, together with a period of drought, have affected the 
beef and pork price trends. Other commodities, including oranges, potatoes, 
tomatoes and eggs exhibit a wide range of seasonal price movement in every 
year, with no discernible trend toward an increasing or diminishing degree of 
seasonality. 


2, IMPLICATIONS OF SEASONALITY IN THE CPI 


Whether or not seasonal influences should be eliminated from the CPI or a 
seasonally corrected, rather than the present index, should be published, de- 
pends upon the magnitude of the seasonal variation and the uses made of the 
index. Official policy with respect to the CPI has been governed in large part 
by considerations related to its use for wage escalation. It has been estimated 
that upwards of 4,000,000 workers are employed under collective agreements 
providing for periodical adjustments of their wages in accordance with changes 
in the CPI. As some of these contracts run for several years, there has been 
considerable pressure on the B.L.S. to maintain the conceptual framework of 
the index in order to avoid the necessity of reopening the agreements. Opposi- 
tion has been expressed to the proposal that a seasonally adjusted index be 
published, in addition to the present series, on the grounds that the existence 
of two official figures will lead to confusion and misunderstanding as to their 
meaning and application. 

From another point of view, it may be argued that wage escalation should 
be based on a seasonally adjusted, rather than an unadjusted index. It is 
pointed out that wage increases resulting from cost of living escalation tend 
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to result rather quickly in price increases. If the wage increase occurred only 
because of seasonal price movement it may be followed later in the year by a 
corresponding wage decrease. But in a generally inflationary situation the lower 
wage cost is not likely to lead to a price reduction. This further feeds the 
inflation. 

In view of the limited range of seasonal movement shown to exist in the 
CPI, it is unlikely that use of a seasonally adjusted series in wage escalation 
would have an appreciable dampening effect on any inflationary tendencies 
that may be generated by the escalation device. If, for example, the contract 
provides for wage adjustments based on the indexes for January, April, July 
and October, the workers would gain 0.2 per cent between January and April 
by the seasonal adjustment and would lose 0.3 per cent between April and July. 
As a test, we might look at what would have happened in the period 1951-53 
under a wage contract similar to the agreement between General Motors Cor- 
poration and the United Automobile Workers, if the wage adjustment had been 
based on a seasonally corrected CPI. For this purpose we will use the moving 
seasonals for those years. Under the contract wages are to be increased one cent 
per hour for each increase of six-tenth’s of an index point. The adjustments 
are made quarterly, on the basis of the CPI at January, April, July and 
October. The contract provides that, with the index at 110.8 or less, no adjust- 
ment is due; between 110.9 and 111.5 wages increase one cent, etc. The hourly 
wage increments, based on the unadjusted and the adjusted CPI work out 
as shown in Table 3. 

Both indexes give the same wage adjustments from April 1951 through 
January 1953. The workers would gain a cent under the adjusted but not under 
the unadjusted index in April 1953, but would receive an increase of two cents 
in July 1953 if the unadjusted index were used. By October, however, the wage 
adjustment would again be identical under both indexes. The effect on the total 
wage bill through December 1953 would be the same under either method. 

Rather than urge the adoption of a seasonally adjusted CPI for purposes of 
wage escalation, it might be wiser to recommend that cost-of-living wage ad- 
justments be made annually, instead of quarterly or semi-annually. This would 
both avoid the seasonal problem and minimize such erratic payroll changes as 
would occur in the above illustration, which, on the basis of the adjusted index, 
has hourly wage rates dropping one cent in January 1953, but the loss restored 
in April. Annual rather than more frequent adjustments are to the dis- 
advantage of the workers, however, under conditions of rising prices. In view 
of the fact that price trends have been generally upward over the past quarter 
century, and there is widespread belief that a gradual increase in the price 
level is inevitable, it would appear unlikely that wage escalation will be volun- 
tarily abandoned by the trade unions, or that they would favor less frequent 
adjustments. Except possibly in periods of unusual price fluctuations, however, 





* The G.M,-U.A.W. agreement in effect in 1951-53 referred to the unrevised CPI on the 1935-39 base. In 1953 
a new agreement adopted the current CPI on the 1947-49 base. For purposes of illustration the revised index has 
been used. 








SEASONALITY IN CONSUMER PRICE INDEX 


TABLE 3. APPLICATION OF CONSUMER PRICE 
INDEX IN WAGE ESCALATION 
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it appears that the inflationary effects of cost of living wage escalation would 
be about the same under a seasonally corrected index as under the unadjusted 
series. 





ALMOST LINEARLY-OPTIMUM COMBINATION 
OF UNBIASED ESTIMATES 


Max HAa.Perin 
Knolls Atomic Power Laboratory 


A procedure for combining averages, each estimating the same param- 
eter, is proposed. If the averages arise from a multivariate normal 
sample, the procedure yields the maximum likelihood estimate of the 
parameter. It is shown that providing the number of vector observations 
is greater than the number of averages being combined, exact confi- 
dence intervals can be obtained based either on Student’s “t” distribu- 
tion or a related distribution (unfortunately not tabulated). If the 
averages arise from unequal sized samples on independent normal dis- 
tributions with unequal variances, similar results hold subsequent to 
random matching and data transformation providing the minimum 
sample size is greater than the number of averages being combined. In 
both cases, the proposed estimate is almost linearly optimum in the 
sense that it has variance equal to that of the minimum variance linear 
unbiased estimate of the parameter except for a multiplicative factor 
which approaches unity as all sample sizes become large. 


1, INTRODUCTION AND SUMMARY 


STATISTICAL problem of frequent occurrence is the combination of averages, 
say 4, %2,---+, 2, each of which presumedly estimates the same pa- 
rameter, say uw. These averages may be correlated, with (%, - - - , #) being the 
vector of means of a multivariate sample (1;, --~- , 2), j=1,2,---+,nor 
they may be independent with #; being the average of n; sample values. In the 
first instance we would want to combine #,, ---, 2 taking into account the 
possibly unequal variances and the correlated nature of the observations. In 
the second case one must consider the possibly unequal variances and the pos- 
sibly unequal sample sizes. In both cases, especially under the normal distribu- 
tion assumption, an appropriate definition of a “best” combination of the esti- 
mates is the minimum variance linear unbiased estimate (MVLUE) derivable 
from the data. These “best” estimates, of course, are linear combinations of 
4%, - - +, & involving coefficients which, in general, must be estimated from the 
sample data. 

For the first problem outlined above, assuming normality and that the co- 
variance matrix is unknown, it is found that the maximum likelihood estimate 
of u, 2 say, is simply the MVLUE with the unknown variances and covariances 
replaced by the usual sample estimates. By an appropriate conditioning it is 
found that one can “studentize” a to allow exact conditional confidence inter- 
vals on » based on a Student’s “t” statistic with n-k degrees of freedom. A possi- 
ble alternative, which is not investigated in detail, arises from the fact that the 
“studentization” can be accomplished in two ways, only one of which leads 
to Student’s “t” distribution. The other possibility leads to an unconditional 
distribution also yielding exact confidence intervals on u; this distribution is 
closely related to the “t” distribution with n-k degrees of freedom, having the 
same symmetrical form but with more probability mass at the extremes of the 
range. No definite conclusion is drawn as to which system of intervals is better 


36 





COMBINATION OF UNBIASED ESTIMATES 37 


since it was not feasible to perform the extensive numerical integration to ob- 
tain the percentage points of the unconditional distribution, which would be re- 
quired for appropriate comparison. Some numerical investigation was done for 
n=25, k=2 and a 95% confidence interval, a case of particular interest in the 
application involved. In this case, it was found that the conditional distribu- 
tion yielded confidence intervals which were shorter on the average. 

The second problem outlined above assuming normality, unknown and pos- 
sibly unequal variances, and possibly unequal sample sizes, can be reduced to 
the first problem by using randomization and a transformation given by Scheffé 
[3]. There is no loss of precision in the reduced sample averages but min, n; 
is the size of the multivariate sample so obtained. Thus all conclusions drawn 
from the first problem hold for the second problem as reduced. 

In both problems £ is an almost linearly optimum estimate in the sense that 
the variance of f is simply the variance of the MVLUE multiplied by a factor 
approaching unity from above as n or min n; increase. 

Some theoretical comparisons between f and alternative estimators are given 
both in terms of variances and average confidence interval length. A numerical 
comparison of various estimates and their confidence interval length in a Monte 
Carlo problem is included. 


2. CASE I: CORRELATED ESTIMATES 


In this case we suppose we have a sample of ——— vectors (24;, ° 
sj) j=1,2,--+,nonN(ye, >.) where e=(I, - - - , 1) and >> is the kXk Pay 
known) wactheada matrix. 

First we consider the estimation of « by maximum likelihood (that the esti- 
mate of this paper is in fact maximum likelihood was pointed out by a referee). 
One can write L, the logarithm of the likelihood as 


# 1 
L = —~tog.| "| - — Des-WG—--=LDoSa 2.1) 
2 2 ih 2 th 


where o** is an element of 


by and Si, = > (wig — Fs) (Zany — Zn). 


j=l 
One thus has as estimating equation for yz, 
ned, (# — ne’) = 0, (2.28) 
where ¥’=(%, -- - , %)’. As estimating equations for o and o* one has 


n (cofactor of o**) 


|| 


Si — n(Z; — w)* = 0, t= 1, 2, eg »k (2.2b) 





and 
2n (cofactor of o**) 
| "| 


From (2.2b) and (2.2c) it is readily shown that, if n>k, 


— 2S, — 2n(%— u)(Zr.— vw) =O th, h>i. (2.2c) 
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f+ ae [s- nS~'(x’ — pie’)(% — fe)S~ | 


1+ n(# — fe)S-"(x' — pe’) 
where S is the kXk matrix with elements Sj. Substituting (2.3) in (2.2a) one 
has as the (unbiased) estimate of p, 





(2.3) 


eS~'y’ 


eS—'e’ 


(2.4) 





A= 


If n<k, we are in the degenerate case where S does not have an inverse so that 
(2.3) and (2.4) no longer follow from equations (2.2a), (2.2b), (2.2c). We will 
consider only the case n>k. 

Since @ is maximum likelihood, it is, of course, asymptotically normal and 
efficient. Computation of the asymptotic variance of a from the inverse informa- 
tion matrix gives 

V (2.5) 
ar 2 > —=——_ 5 
ne>,‘e’ 
The right hand side of (2.5) is simply the variance of the MVLUE; we shall 
usually denote it by o*.p+ in further discussion. 

The results above, although useful, do not permit exact confidence state- 
ments on u for small samples. We would thus like to “studentize” 2, if this is 
possible. It would perhaps be “natural” to use (2.5) with }>~' replaced by 
) an to “studentize” f. Instead we proceed as follows: 

If >> were known, the maximum likelihood estimate would be given by min- 
imizing 

1 
Q=n— ue) >) (%’ — ue’) (2.6) 


with respect to ». Evidently this would give (2.4) with S replaced by >> and 
would be the MVLUE. Now suppose we apply the following transformation 
to x’, 


z’ = Ax’ (2.7) 


| Orpjn1 
ec ( ), (2.8) 
5 | 


and O;,, and e_, are vectors of k—1 components with all elements zero and 
unity respectively, while I: is the (k—1) X(k—1) identity matrix. We notice 
that A*=TI so that A is its own inverse. Thus (2.6) becomes 


Q = n(e: — pw, t2,°--, 2) D> (21 — ws, ts, - +, a)! (2.9) 
where >); '=A’>_"'A. 


Now we partition }-, writing it as 


r= ( 


where 


(2.10) 


Len Lan 
doen =) 
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where > .n is the element in the upper left hand corner of >>, >>. is a row 
vector of (k—1) elements, )>si= >o’s2, and > 2 is a (k—1)X(k—1) matrix. 
Now define 


Q. = (22,- > +, 2) Dosss (en) «+ +5 2a)! (2.11) 


and observe that exp —4Q, aside from constants, is the density of the multivari- 
ate normal distribution of 2:, 22, - - - , 2: while exp —4}Q,, aside from constants, 
is simply the density of z2, - - - , 2. It thus follows [1] that 


nz, ~ Do > ae 22 (22, (pl tap , 2x)’ |? 





Q-Q= (2.12) 
> 2 Ke >> p Boe | > st 
so that the MVLUE can be written as 
Pmvive = 21 — doa ¥s 22 (22, °° +, Ze)’. (2.13) 


Now we observe that we could obtain @ by simply replacing > by S in (2.6) 
and minimizing with respect to u. Since the argument subsequent to (2.6) does 
not really depend on >> but only on its positive definite symmetric character, 
it at once fellows that an expression for f alternative to (2.4) is given by 


ee ea eo Ra (2.14) 


where S, is defined analogous to >*, and partitioned as in (2.10). 
Thus ~ has the structure of the estimated intercept in a multiple linear re- 
gression. This fact immediately suggests how we should proceed. Thus we apply 


the transformation (2.7) to each set (xi;, +--+, we), j=1, 2, +++, n to get 
(z:;, ** *, 2j). We then consider the conditional distribution of (zu, + + + , Zin) 
given zn, «°°, 2k1, 222, * °°, 2k ** * y Zany °° * » Sen, aNd easily verify that the 


21; are independent and 


N[u + do a12 yee aed Zi)’; doen = 8 > 012 pel > «21)- 


If now we estimate » and > .2 > ~~ under the specified condition in the usual 
regression fashion we see that S,12S,3 is indeed the estimate of the vector of 
regression coefficients and (2.14) is the estimate of the intercept. Finally, we 
note that 


= e’ = eA’, Ae’ = [Soa -— Doan : Sal 


this can be verified by partitioning Pas inverting and performing the indicated 
multiplication. 

From the above discussion it follows that, under the specified condition, 2 
is normally distributed with mean yu and variance 


noon | — + as.'a'| 


where d= (zs, - + +, 2s) =(%:—4#2, + - - , 21:—#) and Sg=Sy22. Note that dSz'd’ 

















40 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1961 


= (Tt sn—-ee1)/n(m—1), where T?_:n-24: is a central Hotelling’s T? statistic 
with the indicated degrees of freedom. Hence, since A is conditionally unbiased, 
the unconditional variance of f is just the expected value of the conditional 
variance, i.e. 03,.[1+(k—1)/(n—k—1)]. This last ‘result makes precise the 
speed of the convergence of the variance of 2 to the asymptotic value o3,+; a 
little algebra shows that for the variance of f to differ from o2,, by 100p% or 
less we must have n>[k(1+p)/p]—(1—p)/p. We cannot make so strong a 
statement about the conditional variance of p. We can say that (T?-1..-241)/ 
(n—1) converges in probability to zero as n—+~ and therefore the ratio of con- 
ditional variance to o2,, converges to unity. Since 7}_;,,-24: is a random vari- 
able we cannot ask for a sample size such that the conditional variance of f 
differs from o% by no more than 100p%. We can ask for the sample size such 
that with probability at least (1—«) the conditional variance differs from op 
by no more than 100p%. It is easy to show that such an 7 is given by the 
smallest integral n satisfying 
k-1 
n= 5 Py-1n—n+i(€-) + (k — 1) 


where Fy_;,,-x4:(€) is the value of Snedecor’s F, on the indicated degrees of 
freedom, exceeded with probability e. 

It also follows from the above discussion that an estimate of nodp_ with n—k 
degrees of freedom is available from the sum of squares of deviations from re- 
gression and that this sum of squares is distributed conditionally and uncondi- 
tionally as no2,, times a chi square with n—k degrees of freedom. It follows 
that we can obtain small sample exact confidence intervals from the condi-, 
tional Student’s “t” distribution with n —k degrees of freedom of the variable # 


ae (a — w) (2.15) 


bons (1 —_ Zee 
n—-1 





where 6}, is the square root of the estimate of o2,, from the sum of squares of 
deviations from regression. Since the distribution of t,_, depends on the condi- 
tioning only via T?_1,,—241, it is possible, in principle, to get unconditional con- 
fidence intervals on » based on the distribution of 


(2.16) 


: 
Te-1,n—041 \'/? 
—_—_—__—- = 


n—1 


Ta = (1 + 


It is clear that the distribution of r,. is very much like that of Student’s 
“t, »” in being symmetrical about zero but that the degrees of freedom are ef- 
fectively reduced since even moments are clearly always greater than those of 
t,x. These two possible bases for confidence interval estimation raise the in-' 
teresting question as tv which distribution gives the “better” confidence inter- 
vals, say in the sense of average shortness. To answer this question would re- 
quire extensive numerical integration to obtain per cent points of 7,-. and un- 
fortunately such computational effort was not feasible. It might be remarked, 
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however, that for k=2, n=25 and a 95% confidence level, a case of particular 
interest for the application stimulating this work, the requisite computations 
were performed and showed confidence intervals based on (2.15) to be superior. 

Another point worth considering is the improvement one can expect by using 
f rather than other possible estimates of u. This improvement can be measured 
in at least two ways. One way involves comparison of the unconditional vari- 
ance of @ with that of alternative estimates. This, however, would only be ap- 
propriate asymptotically since a is non-normal in small samples. Another way 
of measuring the improvement would be by comparing average confidence in- 
terval length using either (2.15) or (2.16); this would appear more meaningful 
for small samples. However, we make both comparisons, basing the confidence 
interval length comparison upon ¢t,-, because of the unknown distribution of 
7,-k. As alternative estimates of » we consider arbitrary weighted combinations 
of any r<k of &, - - - , %, with weights adding to unity. To facilitate compari- 
son we note that the matrix A defined by (2.8) is not unique and that we only 
require that the transformation A be non-singular and such that Zza=y, Ez; 
=0,1=2,---, k. Thus we can let the alternative estimate be z,. It at once 
follows that 





















Variance of p n—2 

——__—_—_—_—_ = (1 — Rk?) ——_—_ _(2.17) 

Variance of z, n—k-—1 > 
where RF? is the multiple correlation between z; and ze, - - - , 2. Thus a will have 





smaller variance than z; providing 










k-1 


n—2 


k?> 





(2.18) 









Since R?<1, (2.18) implies no gain is possible if n<k-+1, in which case is not 
defined, and that for moderate n it is quite possible to lose precision by using 
all the data. 
In comparing confidence intervals at say the (1—2a) level, we observe that 
for z;, we would have (n—1) degrees of freedom for error so that expected half 
confidence interval length would be given by 
n 
r(*) | 
Das oy 2 
VVar 21 2 ————— tn-1 (a) 
n-1 
C+) 
2 
where ¢,-:1(a) is the (positive) value of Student’s “t” with (n—1) degrees of 


freedom, exceeded with probability a. Basing confidence intervals for » on 
(2.15) we find that the expected half confidence interval is 


ie 


VVar p V/2 ————— tn» (a). 
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Thus one has 


Length of Confidence Interval based on p 





Length of Confidence Interval based on 2; 


pe / _n—2 t.+(a) (2.19) 
Ko n—k—1 t-s(a) 





where R? is defined as before. Hence confidence intervals based on f and (2.15) 
will be shorter than those based on 2; if 


pan h 8 Kited 


n—-2 = t-4(a) 


R?>1— 





(2.20) 


The conclusions from (2.20) parallel those from (2.18) with the value of R? 
required for clearcut superiority of confidence intervals based on A being rela- 
tively larger than that required for superiority in the variance comparison. 


3. CASE II: INDEPENDENT ESTIMATES 


We now suppose that we have k independent samples (1, 22, - - - , 2in,), 
i=1,2,---,k, where ra, - + -, Zin, is a random sample on N(uy, o7) and a; is 
unknown. We also suppose that n,=min n; and that the second subscript of 
the x;; reflect a random matching of the k samples. Now, following Scheffé [3] 
we define 


ny, ] nt 
253 =iyj- 


1 x 
/ — 23 + ————= Vira -— Dra 
Ni V iN; hat Ni hat 


4=1,2,---,m;t=2,3,---,k (3.1) 
It is readily verified that 
Ez; = 0; Cov 2ijZpa = O, JH#h; 
ni 
Var 2;; = o; + — o:; Cov 21,24; = o1, ¢=2,---,k; 
Ni 
Cov L1jZik = 0, 7 a h; Cov 2ij2pj = 4 


We have thus reduced Case II to Case I. One can show for this reduced sample 
that 


2 as Be 
Copt is 1 > — 
iml Oj 


which is the variance of the MVLUE in the unreduced sample. All the results 
of Case I then apply, except that f@ is a maximum likelihood estimate only in the 
reduced sample. The details are omitted. 


4, SOME NUMERICAL COMPARISONS 


In Table A we compare some sample results for the almost optimum esti- 
mator of section 2, for the case of two correlated estimates, with the simple 
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TABLE A. COMPARISON OF ESTIMATES AND HALF LENGTH OF 
95% CONFIDENCE INTERVALS FOR COMBINING 
TWO CORRELATED ESTIMATES 


(Sample Size = 25; 2 =optimum estimator; 7 =simple average of the two estimates; 
u* =estimate with smaller variance a priori) 





x 





$ confidence interval lengths for 





B u* a 





.3309 .3343 .3318 .0108 .0130 -0105 
-2039 -2101 .2071 .0097 .0096 -0065 
-2798 -2831 -2811 .0085 .0113 -0087 
- 2867 . 2869 -2857 .0110 .0157 -0113 
-2910 .2788 -2877 .0230 .0272 -0218 
- 7387 .7407 _ , .7283 -0026 .0022 .0054 
. 7609 . 7638 7525 -0033 .0023 -0036 
. 1404 . 1388 .1419 -0084 .0078 -0078 























average of individual estimates and the better of the two individual estimates’ 
in terms of the estimates obtained and their confidence interval half lengths. 
The data we use arise from the Monte Carlo solutions of a number of similar 
neutron transport problems, in which it was possible to derive from the same 
data a pair of estimates for the parameter of interest. The Monte Carlo pro- 
cedure involved twenty-five repetitions of a basic experiment in each case; for 


each repetition a pair of estimates was obtained which, at least approximately, 
could be presumed to have a bivariate normal distribution. The data shown are 
for separate experiments. 

It will be noticed in Table A that the gain in precision of f is never spectacu- 
lar and that in some cases one even loses. This is simply a reflection of the facts, 
implied by earlier discussion, that whether or not we will gain in apparent pre- 
cision by combining estimates is itself a random variable and also that the ex- 
tent of the gain (or loss) is a random variable. One would of course like an 
answer to the question implied in the last remark, i.e. when should estimates be 
combined? It is clear that the answer to this question depends on the true vari- 
ances, covariances and the sample sizes involved and simply does not permit a 
definitive answer without invoking further assumptions. 
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BIAS IN ESTIMATES OF THE U.S. NONWHITE POPULATION 
AS INDICATED BY TRENDS IN DEATH RATES* 
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Scripps Foundation for Research in Population Problems, 
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In the first half of each intercensal decade in the thirty-year period 
1925 to 1955, the age-adjusted death rates for nonwhite persons have 
declined relatively more rapidly than for white persons. In the second 
half of each intercensal decade just the opposite situation prevails, an 
excess of white over nonwhite decline. There are no exceptions to this 
curious pattern for either males or females in this period. Partial ex- 
planation of this phenomenon is suggested in terms of increasing under- 
estimates of the nonwhite relative to the white population in the two 
intercensal periods 1930-40 and 1940-50. 


nN A study of declining trends in white and nonwhite death rates a most 
I interesting and as yet largely inexplicable pattern emerges. In the first five- 
year period of each intercensal decade from 1925 through 1955 (1930-35, 
1940-45, 1950-55) there are greater percentage declines in nonwhite age- 
adjusted rates,' both male and female, compared with rates for white males and 
females. However, the second five-year period of each intercensal decade 
(1925-30, 1935-40, 1945-50) witnesses just the opposite situation, greater 
percentage declines for white males and females compared with nonwhite males 
and females. This alternating pattern is present in spite of greater overall non- 
white mortality decline in this thirty-year period. White male rates declined 
30.2 per cent and nonwhite male rates 42.6 per cent in this period; for females 
the comparable percentages are 49.2 and 53.5. 

Unless we are to assume conditions in the first half of each intercensal decade 
to be particularly propitious for nonwhite mortality decline and those in the 
second half of each such decade particularly propitious for white mortality de- 
cline, we must presume the presence of some kind of systematic error. The 
source of this error might be in the numerator (number of deaths) or the 
denominator (population generalized about) used in calculating death rates. Or 
the source of error may be in both. As I will indicate presently, a plausible ex- 
planation of this phenomenon appears to be in the intercensal estimates of the 
nonwhite population made by the Bureau of the Census. These estimates are 
used by the National Office of Vital Statistics for the computation of death rates 
and other vital statistics such as birth rates.? 

Data which indicate the magnitude of this cyclical pattern in the declining 
mortality experience of the four sex-color populations are given in Tables 1, 2, 
and 3. Table 1 shows age-adjusted rates for each population for the period 1925 





* This paper was read in part at the 1959 National Meetings of the American Sociological Society, Edgewater 
Beach Hotel, Chicago, September 3-5. 

' Only age-adjusted and age-specific death rates are referred to in this paper. 

? For years prior to 1940 intercensal estimates by age, sex, and color used by the National Office of Vital Sta- 
tistics are obtained from Vital Statistics Rates in the United States, 1900-1940, U. 8. Government Printing Office: 
Washington, D. C., 1943; and for intercensal estimates for years after 1940 the source is “Estimates of the Popula- 
tion of the United States and of the Components of Change, by Age, Color, and Sex: 1940 to 1950,” Current Popula- 
tion Reports, Series P-25, No. 98, 1954. 
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TABLE 1. AGE-ADJUSTED DEATH RATES WITH ANNUAL PER CENT 
CHANGES IN RATES, BY SEX AND COLOR, 1925-57* 
(Per 100,000 population) 








Male Female 





Annual per Annual per 
cent change cent change 








W NW Ww NW 





° 


1925 .6 | 2,143. 2,036. 
1926 .1 | 2,207. 3. 
1927 8 | 2,037.6 | —6. 
1928 5 | 2,167. 6. 
1929 8 | 2,193.0] —1. 
1930 .9 | 2,099.9} —5. 


2,077. 
1,926. 
2,024. 
2,001. 
1,920. 


© N00 im im 
I OND 


1931 : 1,989. —2. 
1932 .3 | 1,862. —1. 
1933 ; 1,809. —1. 
1934 . 1,903. 2. 
1935 : 1,846. —1. 


1,809. 
1,701. 
1,640. 
1,670. 
1,605. 


omoow 
wv one 
© © & O & 


1936 . 2,003. 4. . 
1937 ‘ 1,921. —3. ‘ 968. 
1938 , 1,766. —7. ‘ 907. 
1939 ‘ 1,706. —1. ‘ 893. 
1940 ‘ 1,764. 1. : 879. 


1,702. 
1,628. 
1,551. 
1,487. 
1,504. 


CORP Om 
QOnwan 
toe Oe 


1941 .41 1,691.3 | —3. ’ 829. 
1942 .2 | 1,580. —2. ; 798. 
1943 .3 | 1,563. 3. ‘ 820. 
1944 41 1,492.3} —3. ; 779. 
1945 .4 | 1,446. —1, d 752. 


1,433. 
1,327. 
1,337. 
1,265. 
1,193. 


wind en 
Aowae 
NhkoOeRS 


1946 8 | 1,349.6 | —4. : 730. 
1947 .5 | 1,356.4] —1. ' 706. 
1948 .3 | 1,378.0 | —1. 683. 
1949 .7 | 1,346.6 | —2. 659. 
1950 1 | 1,358.5 | -0. 645. 


1,135. 
1,140. 
1,121. 
1,105. 
1,095. 


SCHoae 
o> to wo 


1951 .3 | 1,341. —0. : 633. 
1952 ‘ 1,343. —1. ‘ 622. 
1953 . 1,523. —0.: : 611. 
1954 ‘ 1,232. —4. ‘ 579. 
1955 . 1,230. Ba , 579. 


1,070. 
1,048. 
1,021. 
951. 
945. 


NO wo 























1956 -8 | 1,243.7 0. 
1957 4 


5 . 576.1 951.2 ; 0.6 
1,297.3 2.0 3 584.7 981.1 3.1 








* Computed by the direct method using as the standard population the age distribution of the total 1940 
enumerated population of the United States. 

Sources: Rates for 1925 through 1953 are from U. S. National Office of Vital Statistics, Vital Statisticse—Special 
Reports, 43 (1956), 5-18. Rates for 1954 were obtained from unpublished NOVS data. Rates for 1955 from NOVS, 
Vital Statistics—Special Reports, 46 (1957), 111-159. Rates for 1956 from NOVS, Vital Statistics—Special Reports, 
48 (1958), 193-245. Rates for 1957 from NOVS, Vital Statistice—Special Reports, 50 (1959), cxix. Rates for 1930, 
1940, and 1950 are computed from census enumerations taken on April 1 in each of these years. For all other years 
rates are based on population estimates as of July 1. Rates for 1949 and earlier non-census years are based on inter- 
censal estimates. Rates for 1951 and subsequent years are computed from postcensal estimates. 
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TABLE 2. PERCENTAGE DECLINES OF AGE-ADJUSTED 
DEATH RATES, BY SEX AND COLOR, 1925-55 








Male Female 





Decline for Average annual Decline for Average annual 


Years total period decline total period decline 





W NW W NW Ww W 





1925-30 2. 0 .48 0.26 7. 


1930-35 3. mi 0.74 48 Ws 
1935-40 6. 5 .16 74 10. 


1940-45 te 0 .50 3.88 14. 
1945-50 10. . .08 -20 14. 


1950-55 4.{ 9. .00 -94 10. 
1930-40 9. 16.0 0.95 -61 16. 


f 1 
1940-50 16. 3.0 1.79 2.54 
1925-55 30. 42.6 1.16 1.75 49. 





























Source: Percentages calculated from data in Table 1. 


through 1957 together with the per cent of change over the previous year. Table 
2 gives the percentage declines for the four sex-color categories for the six 
five-year periods from 1925 through 1955 together with five-year average 
annual rates of decrease for each. Table 3 shows the absolute changes in points 
of decline by five-year periods. 

From Tables 1, 2, and 3 the following observations are noteworthy: 

(1) For all four sex-color categories there has been a substantial decrease in 
death rates. This decline has been relatively greater for nonwhites compared 
with whites and for females compared with males. The average annual rate of 
decline for each sex-color category was greater during the decade from 1940 


TABLE 3. ABSOLUTE DECLINES IN DEATH RATES, BY SEX 
AND COLOR, 1925-1955 


(Per 100,000 population) 








Male Female 
Years 





Ww 





1925-30 37. 


1930-35 47. 
1935-40 73. 
1940-45 84. 
1945-50 107. 


1950-55 46.8 

















Source: Calculated from rates given in Table 1. 
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through 1950 than for the preceding period back to 1925 and for the succeeding 
period through 1955. 

(2) After 1930 both nonwhite male and female death rates begin dropping 
at a faster rate than those for white males and females. (The mortality data 
accumulated for the Industrial Policyholders of the Metropolitan Life In- 
surance Company indicate precisely the same year for the beginning of a trend 
of greater percentage declines for both sexes of nonwhite over white policy- 
holders [5].) 

(3) A sizable excess of nonwhite over white percentage declines is apparent 
for both males and females in the first half of each intercensal decade. Just the 
opposite situation prevails in the second half of each intercensal decade. There 
is no exception to this pattern in either sex over this thirty-year period. 

(4) The pattern of death rate decline fluctuates less for the white population 
than for the nonwhite population in both sexes. In the two full intercensal 
decades 1930-40 and 1940-50 the declines in the first half of each intercensal 
decade are about three times as great as in the second half for both nonwhite 
males and females. No such fluctuations exist for white males and females.’ 

(5) The years 1956 and 1957, which saw a general upswing in mortality, 
show a continuation of the pattern with changes for nonwhites more unfavor- 
able than those for whites in both sexes. 

The writer has calculated the percentage declines for the age-specific rates 
for the four sex-color categories over this thirty-year period to determine if 
this alternating pattern is present throughout the age categories from “under 1” 
through “85 and over.” In general, but with several exceptions, this same curi- 
ous pattern characterizes the age-specific rates after 1930. This is shown in 
Table 4. Each age category for which there is a greater white percentage decline 
is marked by an “X.” 

This pattern of alternating greater white and nonwhite percentage declines 
is not present prior to 1925 for age-adjusted rates and prior to 1930 for age- 
specific rates. This does not in any sense imply greater validity for the death 
rates (and the factors which go into their computation) in this earlier period 
because it is only after 1930 that there is any tendency for nonwhite death rates 
to drop at a faster rate of decline than the white rates. It might also be pointed 
out that prior to 1933, when the Death Registration Area became coterminous 
with the continental United States, the factor of internal migration complicated 
the process of estimating the denominator—particularly for nonwhites. As late 
as 1920 only 66 per cent of the enumerated nonwhite population was included 
in the DRA, but by 1930 this figure had increased to 93 per cent [3]. 


EXPLANATION 


Annual death rates are calculated by the National Office of Vital Statistics 
according to the following formula: 





3 Professor A. J. Jaffe of Columbia University has raised the question of why white mortality rates, particu- 
larly among males, tend to decline faster in the second half of the two intercensal decades in the 1930-1950 period? 
Unlike Professor Jaffe, the writer does not regard this as a spurious pattern related to the cyclical decline of non- 
white death rates b the ind dent data of the Metropolitan Life Insurance Company show mortality de- 
cline patterns similar to those for U. 8. white males and females in this period (2, 5, and 6]. For nonwhite males and 
females these Metropolitan data indicate a rather constant rate of decline, unlike the analogous United States series. 
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TABLE 4. EXCESS OF WHITE OVER NONWHITE PER CENT 
DECLINE IN AGE-SPECIFIC DEATH RATES BY 
FIVE-YEAR PERIODS, BY SEX, 1925-55 


(*X” represents a greater white per cent decline) 





1925-30 1930-35 1935-40 1940-45 1945-50 1950-55 








Age 


M F M F F M F F M F 


< 





x x x 
x 


Under 1 
1- 4 
5-14 

15-24 

25-34 

35-44 

45-54 

55-64 

65-74 

75-84 

85 and over 


MA? RK MEXXKXEX 
xX XXXKXXKXXKXX 
KKK KKKKKK ] S 









































Sources: Age-specific rates for 1925 through 1950 are from U. 8. National Office of Vital Statistics, Vital Statis- 
ticse—Special Reports, 43 (1956), 5-18. Rates for 1955 are from Table BO, NOVS, Vital Statistics of the United States, 
1966, Vol. 1, Washington, D. C., G.P.O., 1957, p. XCIII. 


where 


m,;= death rate for any defined ith class—usually for calendar year 
D;=registered deaths in the ith class—usually for calendar year 
P,=population exposed to the risk of death in the ith class (taken as mid- 
year estimated P,) 
k=a constant, usually 1,000 or 100,000. 


In’ assessing the validity of death rates both the numerator (D,) and the de- 
nominator (P,) need to be investigated. Problems of the numerator involve 
completeness of death registration and the accuracy of sex, race, and age state- 
ments on death certificates. With regard to the denominator there are questions 
regarding the completeness and accuracy of the census enumeration and of the 
accuracy of the intercensal or postcensal estimate. 

If, in addition to refusing to accept the alternating pattern of greater percent- 
age declines as significant, we also exclude the possibility of alternating patterns 
in the white-nonwhite proportion of deaths registered or alternating patterns 
in the misstatement of race on death certificates, then only one source of ex- 
planation seems to remain: the enumeration and estimates of the sex-color 
populations. One explanation of this pattern in the downward trend of white 
and nonwhite death rates might be systematically increasing underestimates 
of the nonwhite population relative to the white population in the intercensal 
years. That this is the case is suggested by the differentials in Table 5 which 
indicate the annual and average annual per cent differentials in white-nonwhite 
mortality change by sex for the two complete intercensal decades in the period 
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TABLE 5. WHITE-NONWHITE ANNUAL PER CENT CHANGE 
DIFFERENTIALS (NW-W), BY SEX, 1930-1940 AND 1940-50 
Male Female Male Female 
Year Year 




















1930-31 ’ 1940-41 
1931-32 | —4. ; 1941-42 
1932-33 | —1. 6), 1942-43 


1933-34 2. 0.6] 1943-44 | —1.: 
1934-35 | —1. : 1944-45 | —2. 
1935-36 4. ; 1945-46 | —2.: 
1936-37 | —1. , 1946-47 1 


; 1947-48 3. 
6) 0.6 1948-49 0. 
| 1 





1938-39 —2.4 
1939-40 2.4 





1937-38 —0.9 
—0.3 


8 1949-50 














Source: These differentials calculated from annual per cent changes given in Table 1. 


of our concern, 1930-40 and 1940-50. These differentials are obtained by sub- 
tracting white from nonwhite annual per cent change in age-adjusted death 
rates. A positive differential implies greater white decline (or lesser increase) in 
death rates and a negative differential implies greater nonwhite decline (or 
lesser increase). Note the systematic change from negative to positive dif- 
ferentials throughout the 1940-50 periods for females. Note further that the 
1930—40 decade is not really an exception to the above tendency for males be- 
cause the later years of the decade are characterized by average negative dif- 
ferentials much smaller than in the early years of the decade. 

The writer has no very facile explanation for this phenomenon, but a hy- 
pothesis may be suggested. Is it possible that there has been a relative improve- 
ment in the enumeration of whites compared with nonwhites in the three 
censuses of 1930, 1940, and 1950? The writer proposes a positive answer to 
this question as the most “available” partial explanation for the alternating 
pattern in the two intercensal periods 1930-40 and 1940-50. This is because 
the final official death rates at ages 10 and over for non-census years are calcu- 
lated from intercensal estimates of population components. These estimates 
are based on statistics and estimates of births, deaths, immigration, emigration, 
and number of Armed Forces personnel overseas, but are adjusted to conform 
with census enumerations [7]. 

There is some slight evidence to indicate that there was better enumeration 
of whites relative to nonwhites in 1950 compared with 1940. Coale has made 
some estimates of per cent error of under- and overenumeration in the 1940 
and 1950 censuses by age, sex, and color [1]. He gives three different sets of 
estimates according to three different assumptions based on the hypothesis of 
similar error patterns in the 1930, 1940, and 1950 censuses [1, pp. 26-41]. He 
supports this hypothesis (and, hence, his technique for calculating per cent 
errors in enumeration) on the grounds of similarity in these three censuses in 
census-taking methods, in age and sex ratios, and in the discrepancies from one 
census to the next. 
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Table 6 gives Coale’s estimated per cent underenumerations for the four sex- 
color populations for the 1940 and 1950 censuses according to three different 
assumptions about similar patterns of error. In each case relative underenu- 
meration of whites is smaller than that for nonwhites in 1950 compared with 
1940. This is true for both males and females. None of these sets of assumptions 
gives per cents of underenumeration, however, that would account for any 
great portion of the alternating pattern of greater death rate declines between 
1940 and 1950, but they are all in the direction that supports the hypothesis of 
greater relative enumeration of the white over the nonwhite population in 1950 
relative to 1940.* 


TABLE 6. ESTIMATED UNDERENUMERATION IN THE 1940 AND 1950 
CENSUSES AS A PER CENT OF THE CENSUS POPULATION, 
BY SEX AND COLOR, ACCORDING TO THREE DIFFERENT 
ASSUMPTIONS ABCUT SIMILAR PATTERNS OF ERROR 








Male Female 





Assumptions White Nonwhite White Nonwhite 





1940 | 1950] 1940| 1950| 1940} 1950/ 1940] 1950 





Assuming errors in 1930 equal to the 
average of 1940 and 1950 errors 3.5 | 2.6] 19 19 3 1.8 


Assuming errors in 1930 equal to those 
in 1940 or 1950, whichever is less 2.8 : 17 


Assuming errors in 1940 equal to those 
in 1950 8 -2 | 22 22 | 1.0 6} 10 10 





























Source: Adapted from Ansley J. Coale, “The Population of the United States in 1950 Classified by Age, Sex, 
and Color—A Revision of Census Figures,” Journal of the American Statistical Association, 50 (1955), 29. 


It is important to note that the above hypothesis does not explain the pres- 
ence of this pattern of alternating white and nonwhite excess mortality decline 
at the younger ages. This pattern is found for both males and females in both 
the 1930-40 and 1940-50 iritercensal decades at ages under 1 and 1-4. In inter- 
censal years final official death rates at ages under 1 are computed from regis- 
tration data and rates for ages 1-4 are based largely on registration data. The 
presence of this pattern in the younger years where death rates are computed 
largely from registration data in the denominator may provide a clue to an 
alternative or supplementary explanation of this phenomenon. 

There is an additional factor which may parallel in its effects this tendency 
of an increase in the relative completeness of enumeration of whites over non- 
whites in the last three censuses. It is very likely that the period 1930 through 
1950 saw an increase in the tendency for individuals to be enumerated as 
“white,” but registered as “Negro” on death certificates. The reason for this 
supposition is the great northward and urban movement that has characterized 
southern Negroes since the late 1930’s. Certainly this tendency, regardless of 





* Coale’s calculations here do not take into consideration differences among the last three censuses in five-vear 
age-heaping |1, 28-30). This might affect these estimates somewhat. 
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its relative frequency, would be more common in the North than in the South, 
in an urban environment than a rural one. 


IMPLICATIONS 


The hypothesis of systematic bias in estimates of the population of the four 
sex-color populations which I am suggesting as a partial explanation for the 
alternating trends in white-nonwhite mortality decline has significance for all 
rates calculated from population estimates such as birth rates. While the popu- 
lation estimates are probably reasonably accurate for the total population and 
for the white population, the estimates of the nonwhite population are seriously 
inaccurate.’ Consequently, any rates computed from these estimates will be 
seriously inaccurate. Because of this the utmost conservatism had best be ap- 
plied before significance is attached to any short-term trend in any time series 
of nonwhite rates. This is particularly true in any comparison of white-non- 
white trends because the extent of the differential will tend to be understressed 
in early intercensal years and overstressed in later intercensal years. In fact, 
in some instances the actual short-term trends in white-nonwhite differentials 
may be just the opposite of the trend apparent from the official rates. For exam- 
ple, official death rates show a marked excess of white over nonwhite mortality 
decline for both males and females in the period 1945-50. However, according 
to the independent mortality data of the Industrial Policyholders of the 
Metropolitan Life Insurance Company, which are not computed from popula- 
tion estimates, there is greater percentage mortality decline for both nonwhite 


males and females compared with white males and females for these years [2]. 
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5 Coale estimates the 1950 census underenumeration for white males at 2.7 per cent of the enumerated white 
male population, for white females 2.3 per cent. For the nonwhite population, however, the comparable estimates 
of underenumeration are 15 per cent of the enumerated nonwhite males and 11 per cent of the nonwhite females. 
The total underenumeration of the United States population is estimated by Coale at about 3 per cent of the enumer- 
ated population [1, p. 44]. 








MULTIPLE COMPARISONS AMONG MEANS 


Outve JEAN DUNN 
University of California, Los Angeles 


Methods for constructing simultaneous confidence intervals for all 
possible linear contrasts among several means of normally distributed 
variables have been given by Scheffé and Tukey. In this paper the 
possibility is considered of picking in advance a number (say m) of 
linear contrasts among k means, and then estimating these m linear 
contrasts by confidence intervals based on a Student t statistic, in such 
a way that the overall confidence level for the m intervals is greater 
than or equal to a preassigned value. It is found that for some values 
of k, and for m not too large, intervals obtained in this way are shorter 
than those using the F distribution or the Studentized range. When this 
is so, the experimenter may be willing to select the linear combinations 
in advance which he wishes to estimate in order to have m shorter 
intervals instead of an infinite number of longer intervals. 


1, INTRODUCTION 


HERE has been considerable work done on the problem of finding simul- 
taneous confidence intervals for a number of linear contrasts among sev- 
eral means for normally distributed variables. Scheffé [1] gives a method for 
constructing simultaneous confidence intervals for all possible linear contrasts 
among k means using the F distribution. Tukey’s intervals for all possible 


linear contrasts among k means use the distribution of the Studentized range 
[2]. Each of these methods may be extended to give confidence intervals for 
all possible linear combinations of the k means, as opposed to linear contrasts 
only. 

In this paper the possibility is considered of picking in advance a number 
(say m) of linear combinations among the k means, and then estimating these 
m linear combinations by confidence intervals based on a Student ¢ statistic, 
so that the overall confidence level for the m intervals is greater than or equal 
to a preassigned value, 1—a. It is possible that for some values of k, and for 
m not too large, intervals obtained in this way may be shorter in some sense 
than those using the F distribution or the Studentized range. If this is so, the 
experimenter may be willing to select the linear combinations in advance which 
he wishes to estimate in order to have m shorter intervals instead of an infinite 
number of longer intervals. 

The purpose of this paper, then, is to suggest and evaluate a simple use of 
the Student ¢ statistic for simultaneous confidence intervals for linear combi- 
nations among several means, and to see under what conditions these intervals 
apply. The study was actually made with linear contrasts in mind, since these 
are probably estimated more frequently than other linear combinations among 
means. The paper has been written, however, in terms of arbitrary linear com- 
binations, in order to stress the fact that the method is not limited to con- 
trasts. 

The method given here is so simple and so general that I am sure it must 
have been used before this. I do not find it, however, so can only conclude that 
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perhaps its very simplicity has kept statisticians from realizing that it is a 
very good method in some situations. In any case, the users of statistics in the 
main seem unaware of it, so I feel that it is worth presenting. 


2. THE METHOD 

















Let the k means be yw, - - - , wx, and let the estimates for them be fy, ---, 
fx, which are normally distributed with means mi, -+--, we and with vari- 
ances a;,o°, i=1, - - - , k; let the covariance between fi; and f; be a,;o* for i). 
Here the a;; and a,; are assumed to be known, but o*? may be unknown. Let 
6? be an estimate of o? which is statistically independent of fi, ---, fx and 
such that vé?/o* follows a Chi-square distribution with v degrees of freedom. 
(These conditions are exactly those used by Scheffé in obtaining his intervals 
[1]. The condition that the dispersion matrix of the 2; be known except for a 
factor o* is necessary in order to construct ¢ statistics which are free of nuisance 
parameters. ) 

Let the m linear combinations of the means which are to be estimated be: 


(1) 










s=1,2,---+,m. 





0, = Cisfi + + + * + Crste, 


A linear combination is, in particular, a linear contrast if 


k 
Dd Cie = (). 





The unbiased estimates for @,, - - - , Om are 


6, = cisfli + - + - + Chef, s=1,2,-:-,m. (2) 


















These are m normally distributed variables, and the variance of 6, is 6,20, 
where 


B . 


b, = bn >» AgsCisCjs. 
t=1 j=l 
This reduces the problem to one I discussed earlier [3], of finding confidence 
intervals for the means of m normally distributed variables. 
The variates ¢;, - - - , tm, each of which follows a ¢ distribution with » degrees 
of freedom, are formed: 





s=1,--+,m. (3) 








The variates ¢, t2, -- + tm have some joint distribution function; using a 
Bonferroni inequality, one can obtain a lower limit to the probability that all 
the ¢,’s lie between —c and +c (where c is any positive constant) without know- 
ing anything about this joint distribution except that all the marginals are 
Student ¢ distributions. 

Thus 


Pi[-c<t<c i=1,2,--+ mjp>1- 2am f F°(dt, (4) 
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where f (¢) is the frequency function for a Student ¢ variable with v degrees of 
freedom. 

If c is selected so that the right hand member of (4) equals 1—a, then con- 
fidence intervals with level 1—a are obtained from 


se 


6, — 9, 
p| -e < ; <ec,s=l,--- > ; (5) 


They are 
6, + cb.é, g=1,2,:--,m. 


Here the overall confidence level for the m linear combinations is 1 — a, where 
c is defined by 


f “FOdt =, and FH 


2m 


is the frequency function of a Student ¢ variable with v degrees of freedom. Some 
or all of these linear combinations may of course be linear contrasts. 

When fi, - - - , & are the sample means jj, --- , J, and when j1,---, He 
are statistically independent, then a,,=1/n,;, where n, is the size of the sample 
for the j,’s; for i*j, a;;=0. The confidence intervals for ¢y,yi+ - + - +Cksue 
become 


(Cis + 2 + Cksx) + c 


Table 1 gives values of c for 1—a=.95 and for various values of v and of m; 
Table 2 gives c for 1—a=.99. These tables as well as the other tables appearing 
in this paper have been computed from Biometrika Tables for Statisticians, Pear- 
son and Hartley [4]. 


3. COMPARISON WITH INTERVALS USING F DISTRi8UTION 
Scheffé’s intervals for any number of linear contrasts among k means are 
6, + Sb.é (8) 


where S?=(k—1)F.(k—1, v). Here F.(k—1, v) is the 1—a@ point of the F dis- 
tribution with k—1 and » degrees of freedom, and the other symbols are de- 
fined as in Section 2. 

When intervals for a number of linear combinations are desired (not restrict- 
ing them to linear contrasts), the intervals are as given in (8), but with & 
=kF,(k, v). 

Since the ¢ intervals in (6) and F intervals in (8) are seen to be of exactly the 
same form, and require exactly the same assumptions, it is both easy and useful 
to compare them. 

The main difference between them is that in the ¢-intervals, the set of linear 
combinations which are to be estimated must be planned in advance, whereas 
with Scheffé’s intervals they may be selected after looking at the data, since 
Scheffé’s method gives intervals for all possible linear combinations of k means. 
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large set of linear combinations which includes all those which might conceiv- 
ably be of interest. Then, on looking at the data, one may decide on actually 








possible in using the ¢-intervals to select as the intervals to be estimated a very 








This is, of course, a considerable advantage for Scheffé’s method. It is, however, 
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computing intervals for only some of this set. Section 5 gives an example of 
this procedure. 

A second difference between the methods is that the leagths of the t-intervals 
depend on m, the number of linear combinations, whereas with Scheffé’s 
method the lengths depend on k, the number of means. It seems reasonable to 
suspect, then, that the ¢-intervals may be shorter for small m and large k, and 


TABLE 3. VALUES OF c*/S? FOR 1—a=.95, .99 








1—a=.99 





s | 1 | 15 
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that the F intervals may be shorter for large m and small k. This turns out to be 
true. 

Perhaps the most appealing way of comparing the two methods, from the 
standpoint of the research worker, is on the basis of length. To do this, Table 3 
gives values of c?/S? for certain values of k and m, for »=7, 20 and , and for 
1—a=.95 and .99. Here S? is defined as (k—1)F.(k—1, v), so that the table is 
applicable as it stands when linear contrasts are being estimated. The square 
root of c*/S? is the ratio of the length of a t-interval compared to the length of 
the corresponding Scheffé interval. Thus for l1—a=.95, v= 0, m=50 and 
k=10, one has c*?/S?=.64. This means that if one wishes to estimate 50 linear 
contrasts among 10 means, that each of the 50 t-intervals is .8 times as long as 
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the corresponding interval using the F distribution. A second interpretation is 
that about 64 per cent as many observations are necessary using the ¢-intervals 
to obtain the same precision as using the F distribution. 

If one is estimating linear combinations among means rather than simply 
linear contrasts, one enters the table with a k value increased by one. Thus for 
50 linear combinations among 9 means, with 1—a=.95 and »= ~, one has 
c?/S? = .64. 


TABLE 4. VALUES OF ms, THE MAXIMUM NUMBER OF LINEAR 
CONTRASTS OF k MEANS FOR WHICH F INTERVALS ARE 
LONGER THAN ¢-INTERVALS, 1—a=.95, .99* 








je os | ti ewee 





20 





0 0 
2 ‘ ‘ ‘ 3 
5 10 
10 23 
6 3¢ 55 46 

6 129 3: 104 

7 281 ¢ 19X10 59 X10 
3 614 38 X10 14X10? 
1 

5 


1 
) 
2 
3 
5 
7 


126 X10 316 X10 
9 27 X 10? 696 X10 
123 56 X 10? 149 X 10? 
158 108 X10? 310 X 10? 
190 223 X 10? 694 X10? 
26 X10 426 X 10? 150 X 10° 
30 X10 872 X 10? 312X108 
38 X10 182 X 10° 66 X 10* 
45 X10 329 X 108 12X105 
54X10 635 X 108 26 X 105 
132 K 104 6X 10° 











* The last significant digit given in ms cannot be expected to be exactly correct except where mg is smaller than 
100. For example, mg = 26 X10 indicates that at some point in the calculation a number with only two significant 
digits was used; in computing ng =36, on the other hand, three significant digits were carried throughout. 


It appears from Table 3 that for a fairly large number of means, the ¢-inter- 
vals are shorter for any m of reasonable size. 

In Table 4 are listed values of ms, the maximum number of linear combina- 
tions for which the Scheffé intervals are longer than the ¢ intervals. If, for a 
given k, v, and 1—a, one decides to estimate m linear contrasts among k means, 
one may examine ms from Table 4. If m<msg, the Student ¢ intervals are 
shorter; if m>ms the Scheffé intervals are shorter. Table 4 gives ms for v=7, 
20, ©, l—a=.95 and .99, and k=2, 3, - - - , 20. The table indicates that for k 
as large as 10, one may form a large number of contrasts using the /-intervals 
and still have intervals smaller than if the F distribution had been used. 

If one wishes to estimate linear contrasts among 9 means with »=20, l—a 
= .95, then entering Table 4 at k=9, one finds ms= 189, so that if the number 
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of contrasts to be estimated is less than or equal to 189, the Student ¢ intervals 
should be used rather than Scheffé’s intervals. If linear combination rather 
than just contrasts are being estimated, then one enters the table with 10 rather 
than 9 and finds ms = 300. 

Examination of Table 4 indicates that the situation becomes more favorable 
for the Student ¢ method if, as all other variables except one are held constant: 

i) k is increased; or 

2) » is increased; or 

3) 1—a is increased. 


4. COMPARISON WITH TUKEY’S INTERVALS 


If fi, -- -, @ are unbiased, normally distributed estimators of ui, --- , we 
with Var (@,;)=a,,0? and Cov (f;, 2;)=0, then confidence intervals of level 
1—a for all possible linear contrasts 


k 
6 = Cyr + +++ + Come, > « = 0, 
t=] 
are given by 


l 


2 


6+ 


k 
. | | Va1196, (9) 
tel 


where q is the 1—a point of the Studentized range for a sample of size k, and 
é* is an independent estimate of o? such that vé?/o* has a x? distribvtion with v 
degrees of freedom. 

For 2;=9;, based on a sample of size n, these become 


6+ > | c:| q@/Vn. (10) 


t= 1 


It is apparent that, as formulated here, these intervals are more limited in ap- 
plication than the ¢ intervals and Scheffé’s intervals, since (1) they apply only 
to linear contrasts rather than to arbitrary linear combinations; (2) the vari- 
ances are assumed to be equal; (3) the covariances are assumed to be all zero. 
Tukey [2], states without proof that they may. be extended somewhat in all 
three directions. 

Limitation (1) may be removed by introducing a (k+1)st mean whose esti- 
mate, fxsi, is always zero. Tukey shows that there is no appreciable error in 
simply using the same intervals for linear combinations as for linear con- 
trasts, provided k>2. To be ultra conservative, one may use the same intervals 
but enter the tables of the Studentized range with k+1 rather than with k. 

In using Tukey’s intervals for combinations as opposed to contrasts, it should 
be noted that formulas (9) and (10) must be altered by replacing 4).|c¢;| by 
the larger of the sum of the positive {c;} and the negative of the sum of the 
negative {c,}. 

In an effort to remove limitation (2), Tukey considers the case where Var 
(A;) =ayo?, t=1,---, k, and Cov (Qf, 2;)=0, 47, with the a; known con- 
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stants. In other words, the covariances among the 4; are zero and the ratios 
among their variances are known. 

By multiplying the gé@ in (9) by a factor depending on the particular con- 
trast and on the various a;,, instead of by \/ay, he obtained intervals with an 
overall confidence level which he says is approximately 1—a, adding that work 
is being done on studying this approximation. 

Limitation (3) may also be removed when only contrasts are being consid- 
ered. If Var (@;)=ai10°, i=1,---, k, and Cov (Aj, @;) =ai20*, ij, where ay, 
and diz are known, then the confidence interval for 


b 
6= > C ihe; 


i=1 

becomes 
1 k 
” + =e» | cs | (ay; ~ Qy2)'!*qe. (11) 
“= i=l 

Thus the contrasts using the Studentized range have been partially extended, 
but not to the more general situation where the variances and covariances of the 
A; are a;,o? and a,c", with the a;; and a,;; known. The extension to unequal vari- 
ances seems to be somewhat arbitrary, and I do not know whether it has been 
put on any satisfactory basis. 

Comparison of the lengths of the Studentized range intervals with the ¢- 
intervals is complicated by the fact that the lengths of the Tukey intervals de- 


pend on .|c;| whereas the lengths of the ¢ intervals depend on 


k k 


Do De Fislial in. 


il jl 


Scheffé [1] compares the squared length of an F-interval with the squared 
length of the same Studentized range interval, and then considers the maximum 
and minimum values of this ratio over all types of contrast. He points out that 
this squared ratio is a maximum for intervals of the type u;—»,, and a minimum 
for intervals of the type 


2(ur + + + + weya)/k — Q(uasena + + > + + me) /k 


for k even, or of the type 


k k 
Gat ¢aread / (5-1) = Gut +a) / (SF +1) 


for k odd. 

In Table 5 are given values of c?/q*. From this table, for any particular con- 
trast one may compute the squared ratio of lengths for the ¢-intervals and 
Studentized range intervals. For variances equal to a;,07 and covariances zero, 


this squared ratio is 
a[ > Cn: ( y 3s | ¢.: | )] (c*/q?). 
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TABLE 5. VALUES OF c*/q@? FOR 1~—a=.95, .99 
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.87 ‘ 33 .29 .27 72 45 .36 32 .30 
10 1.03 : .40 34 31 .82 51 41 36 34 
50 1.41 , .54 47 43 1.04 65 .52 47 43 

100 1.58 ‘ 61 .53 48 1.14 72 57 51 47 

250 1.80 ‘ .69 .60 55 1.27 .80 .63 57 53 



































In Table 6 are given values of mr, the maximum m such that every ¢ interval 
is shorter than the corresponding Tukey interval, even for the least favorable 
(to the ¢ interval) case, u;—,. A glance at Table 6 shows that mr tends to be 
rather small. If one’s primary interest is in intervals like u;—y,, then he may 
use this table to decide which method to use. Otherwise a comparison must be 
made for each type of interval which is of interest. 

Again, as in the comparison between Scheffé’s intervals and the ¢ intervals, 
the ¢ intervals seem to become better, other things being equal, as 

1) k becomes larger, or 

2) » becomes larger, or 

3) 1—a becomes larger. 


In an analysis of variance situation with a single variable of classification, 
the number of means would tend to be small and primary interest might be in 
estimating the difference between means. Then Tukey’s intervals are perhaps 
preferable. When there are two variables of classification, then one perhaps 
wishes to estimate row differences, column differences, and interactions rather 
than the differences between single means. Then the ¢ intervals are more likely 
to be shorter. 
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TABLE 6. VALUES OF mr, THE MAXIMUM NUMBER OF LINEAR CON- 
TRASTS OF k MEANS FOR WHICH EVERY STUDENTIZED 
RANGE INTERVAL IS LONGER THAN THE COR- 
RESPONDING t+INTERVAL, 1—a=.95, .99 









































\ v l1—a=.95 l—a=.99 
oe 7 epg aes 7 20 - 
2 0 0 | 0 0 0 0 
3 2 2 2 2 2 2 
4 3 4 | 4 4 4 5 
5 5 6 7 6 7 8 
6 7 9 11 ~ 11 13 
7 9 12 15 10 14 17 
8 i! 16 20 12 18 24 
9 13 20 25 14 23 30 
10 14 24 31 17 28 38 
a- i  - 23 39 19 33 46 
12 | 18 33 17 21 39 54 
13 20 37 53 24 43 63 
14 22 43 | 61 26 50 74 
15 24 18 71 29 56 84 
16 26 53 82 31 64 95 
i. | 28 60 92 33 71 112 
18 30 | 66 103 35 80 | 122 
19 32 | 71 } + IM 37 Bi, 4 >888 
20 | 34 | 77 | 125 39 104 | 157 





5. AN EXAMPLE 


As an example of an experiment in which one may wish to choose among 
these methods, consider the fixed-effect model for a two-way classification with 
a rows, 6 columns, and n observations in each cell. 

Let 2, be the kth observation in the ith row and the jth column. Then 
E (xij) = pg =uta;+b;4+1;, and Var (%;;.) =o?/nfort=1, ---,a;j=1,--°+,b; 
k=1,---,n, with 

b 


Da: = 0, > b; = 0, ~ 1s = 0, j=1,-:-,6, 
i=1 


j=l te=l 


and 
b 
> 14 = 0, t=1,--+-+,4a, 


j=l 


The pooled estimate of the variance, 6*, has n(a—1)(b—1) degrees of freedom. 
Here a;;=1/n, a;;=0 for i+), and if 


6= ; CijMij 
iJ 
is any linear combination of the ab means, the point estimate for it is 


A = Cyliz., 
i,j 
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and the confidence interval using the (-statistic is 


Dd ci¥ss. + c Dd cs/n 6. 
iJ i,j 


In the first column of Table 7 are listed various linear combinations which 
the experimenter may wish to estimate. For ease in comparison, they have been 
multiplied where necessary by a factor chosen so that the lengths of the Tukey 
intervals are all equal. In the second column are listed the usual point estimates. 
Those listed in rows 5 to 10 of the table are linear contrasts, whereas those in 
rows | to 4 are not. In the third column are given the number of each type of 
linear combination; the fourth column gives 


2 
z C4j;/N, 
i,j 


to be used in the confidence interval for that type of contrast. 
The last two columns of Table 7 give 


c i bu ci;/n 
‘J 
for a=3, b=4, n=3 and for a=4, b=5, and n=4. In these columns, c has been 
computed on the assumption that the experimenter wishes to estimate all the 
linear combinations listed, with an overall confidence level of 1—a=.95. To 
compare the ¢-intervals with Tukey’s intervals, the values in columns (5) and 
(6) must be compared with 2.99 and 2.64, respectively, since Tukey’s intervals 
for all these linear combinations are ¢,;%;,.+ (¢/Vn)é. 

To compare the lengths of the ¢-intervals with Scheffé’s intervals, one must 
compare the values of c, 3.97 and 4.31, with 5.11 and 5.92, the corresponding 
values of S. 

It is possible that the experimenter may be interested in estimating not all 
the linear combinations. In Table 8 are shown values of c for various sets of 
linear combinations estimated: rows 1 to 10 inclusive (all the linear combina- 
tions listed) ; rows 1 to 9 inclusive (all the linear combinations listed except the 
differences between means); rows 5 to 10 inclusive (all the linear contrasts 
listed) ; and rows 5 to 9 inclusive (all the linear contrasts listed except the dif- 
ferences between means). Table 8 also gives the corresponding values of S. 

In these particular examples one should probably pick the ¢-statistic over the 
Studentized range. The exception to this (considering only length of interval) 
would be if one is mainly interested in estimates of y.;—y, ;; this does not seem 
likely. It should be emphasized, however, that if the research worker wants 
intervals as short as possible, each problem must be examined separately. 


6. DISCUSSION OF THE STUDENT it METHOD 


It is interesting to consider what may be the actual probability of coverage 
using the Student ¢ method. In (4), we may let the left side (the actual probabil- 
ity of coverage) be denoted by P and the right side (the lower bound for this 
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TABLE 8. COMPARISON BETWEEN LENGTHS OF t-INTERVALS AND 
F-INTERVALS IN TWO-WAY CLASSIFICATION EXAMPLE 
(1 —a =.95) 


a=3, b=4,n=3 s=4,b=5,n=4 











Types 
Estimated Number Number 


Estimated Estimated 8 





1 to 10, incl. 84 ; my 195 ; .92 
lto 9, incl. 48 , : 75 ; .92 
5 to 10, incl. 64 ° 9% 165 ; .78 
5 to 9, incl, 28 . ; 45 3. .78 























probability of coverage) be denoted by P, and consider the difference between 
P and P. 

If all the correlations approach unity, then in the limit all the linear combina- 
tions become one and the same linear combination, and P attains its largest 
possible value. In this case, 


P=P(-c<t<ce)=1- 2f J (Wdt. 


For y= x, P=1—a=.95, and m=100, c=3.48, and P=.9994. 

In [3] I conjectured that P attains its smallest possible value when all the 
correlations are zero. This was established, however, only for m=2 and 3. 

Extensive tables are not available at present to evaluate P when all the cor- 
relations are zero, though Pillai and Ramachandran [5] give 95 per cent points 
and 99 per cent points of the necessary distribution for m less than or equal 
to 8. For v= «, however, the normal tables may be used. For P=1—a, P 
=[1—(a/m)]|", so that P is the first two terms in the binomial expansion 
of [1—(a/m)]™. The difference between them is seen to be bounded by 
[(m —1)/2m]a*, so that for v= © and a small, P is fairly close to the actual 
probability of coverage when all the correlations are zero. In particular, for 
l—a=.95 and m=100, P= (1 —.0005)!°° = .9512. 

Thus for y= « and a small, the inequality (4) gives results which are almost 
as good as any which are attainable when nothing is known about the correla- 
tions. At present I am attempting to construct tables which will give some idea 
of the situation for small values of »v and for correlations between 0 and 1. 
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ON SOME MEASURES OF FOOD MARKETING SERVICES* 


Grorce W. Lapp 
Iowa State University 


Some economists have applied multiple regression analysis to national 
aggregate time series data to measure elasticities of demand for all food 
marketing services. This paper demonstrates the inappropriateness of 
the measures of marketing services price, quantity and value used in the 
analyses. The services price index used was actually a services value 
index; the services quantity index used was an index of food quantity; 
and the measure of value used contained errors of observation of such 
a nature as to lead to biased elasticity estimates. Finally, formulas are 
suggested which might be used for the construction of a Laspeyre’s 
index of services prices and a Paasche index of quantity of services. 


1, INTRODUCTION 


ARMERS and legislators have a long-standing and continuing interest in the 
Prabject of marketing margins for farm produced foods; this interest intensi- 
fies during periods of low or falling farm income such as we have recently ex- 
perienced in this country. In response to demands for information by these 
groups, the U. 8. Department of Agriculture has for years published data on 
the distribution of the consumer’s food dollar between farmers and marketing 
firms ({8] and references cited therein).' 


Recently Marguerite Burk [2], Rex Daly [5] and E. W. Bunkers and Willard 
Cochrane [1] have attempted to estimate consumer demand for food marketing 
services by the use of regression analysis. It is the contention of this paper that 
they have not estimated the demand for food marketing services since they 
have measures of neither the price nor the quantity of marketing services. 


2. MISS BURK’S AND DALY’S MEASURES OF PRICE AND QUANTITY 


Daly defines M = M,-M, where M is the total marketing and processing bill 
for food products, M, is the price of marketing services and J/, is the quantity 
of marketing services. 47, is his dependent variable. He writes, “The marketing 
and processing margin quantity (M,) was computed by dividing the marketing 
bill (M) by the index of the marketing margin for the ‘market basket of farm 
foods’ (M,)” ([5], p. 658, fn. 2). Miss Burk follows substantially the same pro- 
cedure to obtain her dependent variable. She writes, “The total market value 
of all food . . . less the total supplier value [47] can be divided by a price index 
for marketing services between the farm and retail levels of distribution [M,] 
(derived from the marketing margin of the farm market basket). Placing these 
aggregates on a per capita basis and comparing them with the base period 
1947-49 provides a measure of changes in consumption or use of food marketing 
services... ” ([2], p. 909). 

* Journal Paper No. J-3656 of the Iowa Agricultural and Home Economics Experiment Station, Ames, Iowa, 
Project No. 1355, Center for Agricultural and Economic Adjustment cooperating. 

1 The marketing margin is the difference between the retail value and the farm value for a comparable amount 
of product. Included in the marketing margin are all costs and profits or losses incurred by businesses engaged in the 


procersing and marketing of farm produced products. Marketing services are the services performed by these busi- 
nesses, e.g., storage, transportation, packaging, precooking, freezing, sorting, grading, canning. 
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The “total market value of all food” is equal to consumer expenditures on 
food products plus the farm value of products consumed directly in the farm 
home. The market basket series is constructed by pricing a fixed market basket 
of farm food commodities at the farm level, using average farm prices, and at 
the retail level, using average retail prices. The marketing margin is the dif- 
ference between the farm cost and the retail cost. Currently the items in the. 
market basket are the average quantities of farm food products bought per 
urban wage earner and clerical worker family in 1952. The index expresses the 
marketing margin as a percentage of its average 1947-49 value. 

M, is not a measure of the price of a unit of marketing services and M/M, 
is not a measure of the quantity of services. This can be shown with the use of 
the following notation: 


m,= the current marketing margin or marketing bill for one unit of product j, 
a,;=current input-output coefficient=the amount of resource i used to 
market one unit of product j, 
c;=current price of resource 7, 
X;= units of j marketed currently, 
X,;= units of j purchased by urban wage earner and clerical worker families in 
the base period, 
m;=the base period marketing margin or marketing bill for one unit of 
product j, 
d,;= value of input-output coefficient during the base period, 
€;= base period price of resource 7, 
& ;= units of marketing service k performed on one unit of product j cur- 
rently, 
5x; = units of service k performed on one unit of j in the base period, 
pe= current price of one unit of service k, 
px = base period price of one unit of service k. 


mj = ps Cai; = Pi8kj- 
é k 
The total marketing bill for product j is 
Xym; = X; Do emis = Xj Do pads. 
‘ k 
The total current marketing bill (the Burk-Daly M) is 
D Xm; = DD c@sXs = LDL pesesX;. 
I ¢ 4 2 
The Burk-Daly M, is? 
+ Xm; po > Ci Xj 
SA shea (4) 
DL Xm; Dd aX; iSesP 
j Ae j 








According to equation (4), M, is a value index, reflecting changes in the price 





2 This index has been published ((3], p. 34). 
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of services, px, and in the quantity of services, s,;. Alternatively, it reflects 
changes in factor prices, c;, and in input-output coefficients, a;;. 
Dividing, 


u X jm; 2 2X X jPr8kj 


M 





neo X ips (5) 
M, s Xm; 2 Xm rs 5 X jPeSrj ~ ux “aad 


This is Daly’s M,.* Dividing by population gives Miss Burk’s measure of the 
quantity of services. This is not an index of the quantity of marketing services. 
It is an index of the quantity of food weighted by current period margins (and 
multiplied by the constant }-X mi,);i.e., it is a Paasche index of the quantity of 
food.* 

In summary, Daly and Miss Burk have measures of neither the price nor the 
quantity of marketing services. Since their dependent variable is a measure of 
quantity of food, their income elasticities are elasticities for food, not elasticities 
for marketing services. 


3. THE BUNKERS-COCHRANE MEASUREMENTS 


Bunkers and Cochrane [1] have also studied the demand for food marketing 
services, using multiple regression analysis of time series data. Their results are 
of doubtful usefulness because of the nature of their dependent variable and of 
one of their independent variables. 

One of their independent variables is the ratio of the “index of off-farm food 
service prices” ({1], p. 217) to the index of farm food product prices. The price 
of off-farm food services is the same as the price of food marketing services. 
They apparently measure this price in the same way as de Daly and Miss Burk. 
This casts doubt on the accuracy of their estimated income elasticities since the 
incorrect specification of one independent variable, X,, results in biased esti- 
mates of the coefficients of those variables which are correlated with the cor- 
rect form of X, ({6], [7], pp. 214, 327-9). We would expect the true price of 
marketing services to be correlated with consumer income. 

Their dependent variable is also faulty. This variable is “the real value of off- 
farm food services” (V;,) ({1], p. 213). The nature of Vj, can be examined with 
the use of the following notation: 


X;;= units of j currently consumed on farms where grown, 
X;= units of item j currently marketed, 

r;=current retail value of jth product, 

j;=current farm value of jth product, 

R=retail food price index, 

F =index of prices received by farmers for farm food products. 
Ve equals = 





* This argument has ignored the possibility that a slightly different group of products might be included in Xj 
than in X; and hence that the ranges of summation for Xj, ej, pk, aij and cj might be different than the ranges of 
summation for Xj, 54j, Bx, dij and é;, respectively. Admitting this possibility would make no difference in the conclu- 
sions concerning the nature of My and M/Mp. 

¢ Miss Burk read an earlier draft of this paper. She and I are in agreement on many of the shortcomings 
in the available data. She does not, however, axcept my argument that M/M, is a food quantity index [4]. 
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X; 2i FjX3; LIX; + LIX) 
j —|, (6.) 


which, in turn, is equal to 


l l l 1 
ae as £X)+(— LBs ee iX)). 6.b 
R ur a x . R 2s R us ‘ _ 


Combining the first and last terms, and placing the second and third over a 
common denominator yields 


l > xX 5 X;. ~ 
r* i(r; — fi) ea rR * fiz (7) 

The first term in (7) is 1/R times the Burk-Daly M in equation (3). This term 
could reasonably be called the “real value of off-farm food services.” Bunkers 
and Cochrane, however, reserve this name for the whole expression (7). The 
second term in (7) can be considered as an error of measurement in V;,. Bunkers 
and Cochrane used V},/N, and V’,/N, as dependent variables, where V,=num- 
ber of consumer food spending units and N,= number of adult equivalent food 
expenditure units. The errors of observation in V’,/N. and V;,/N, are corre- 
lated with the independent variable—per capita income. The simple correla- 
tions are, respectively, 0.86 and 0.74. The presence of errors of observation in 
the dependent variable leads to biased estimates when the errors are correlated 
with independent variables.® 

Thus their income elasticities have two possible sources of bias: (1) an in- 
accurate measure of an independent variable—price of services—whose true 
value is probably correlated with income—ancther independent variable, and 
(2) an inaccurate measure of the dependent variable in which the measurement 
errors are correlated with income.* The two sources of bias might offset each 
other; they might also cumulate. 


4. ALTERNATIVE MEASURES OF PRICE AND QUANTITY 


Indexes of the quantity and price of services might be constructed as follows. 
Compute the Laspeyre’s price index 


Zz p> X Ser 


} pm X i5esPe ” 





where X; now refers to all food products consumed in this country in the base 
paren. Div — the total marketing bill by this index, yields 





5 Suppose a true model is, in matrix notation, y = X8+u and we have measures of Y =y+e, where e is an error 
of measurement. Let the estimated regression be Y = X8+u-+e. Applying Theil’s method of analysis ((7], pp. 214, 
327-9), expected b =8+-(X'X)-1X’e. 

* This same method of analysis is also applicable to Daly's and Miss Burk’s estimates of income elasticities. 





MEASURES OF FOOD MARKETING SERVICES 
DL LX Xisem 
YR 
, > + X ise 
kj 


Formula (9) is a Paasche index of the quantity of services; it differs from (5) in 
the construction of the price index. The same denominator appears in (8) as in 
(4); however, the numerator of (8) contains 5; whereas the numerator of (4) 
contains s;,;. This small variation makes the difference between a food quantity 
index and a services quantity index. 


LD Xiside. (9) 
yy 
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UNBIASED RATIO ESTIMATORS IN STRATIFIED SAMPLING 


Jose Nieto pe PasevaL 
Iowa State University 


Several unbiased ratio estimators (URE) in stratified random sam- 
pling are presented. The “separate” URE is a direct application of the 
Hartley-Ross URE within strata. The “combined” URE, for equal 
sample size k per stratum, is obtained by means of a complete replace- 
ment, repeated k times, of a sample of size one per stratum. The relative 
efficiencies of these estimators with respect to the usual biased RE are 
discussed. An approximately URE is developed and it is shown that, 
for large samples, it is as efficient as the “combined” biased RE. Finally, 
an approximately unbiased estimator r* of the population “rate” Y/X 
is given, and its efficiency relative to the “rate-of-means” estimator 
4/2 is investigated. 


in 1954. More recently, Goodman and Hartley [3] have studied the pre- 
cision of this estimator for large samples. Robson [7] has given the variance 
of y’ in the case of a finite population by utilizing multivariate polykays. 
Mickey [6] has put forward a general theory for constructing unbiased ratio 
and regression estimators from which 7’ comes out as a special case. These 
results have assumed a simple random sample of n out of N pairs (x, y). The 
present paper deals with some theory of unbiased ratio estimators in stratified 
random sampling. 
The Hartley-Ross estimator of Y, the population mean of the y,’s, is given by 
N-1 n 


y= 7X + ——— —*_ (9 — #8) 
y . N ~pelr ” 


.\ UNBIASED ratio estimator y’ was first presented by Hartley and Ross [5] 


n 
F = (1/n) D> (ys/xs), 
X is the (known) population mean of the z-characteristic, and 9 are sample 
means and N is the population size. The estimator y’ can be expressed also in 
a regression-type fashion, assuming N>>n, as 


l : ae 
y= 9+—— ¥ (yi/2) (X - 25) (2) 
n(n cree 1) inj 


which turns out to be a special case of a more general unbiased regression 
estimator developed by Williams [8]. Equation (2) clearly reveals the un- 
biased nature of y’. 


A. “SEPARATE”? UNBIASED RATIO ESTIMATOR 


Assume that a random sample of m, out of the N, pairs (%ai, ya:) is drawn 
from the A-th stratum (h=12, 2,---, LZ). Let 


L 
n= pe Tp. 


70 
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Clearly, the stratum mean Y, is estimated unbiasedly by 
yx = FXn + [(ra)/(ra — 1)]- (Gn — Fra) (3) 


where it is assumed that N,>>ma. 
It is immediate that an unbiased “‘separate”’ ratio estimator is given by 


L 
ye = > (Ns/N) yk (4) 


h=1 


i.e., a straightforward generalization of y’. 
The variance of y’, for large N, has been given by Goodman and Hartley 
[3]. We may utilize it, therefore, to write the variance of y/ as 


od -: Mi bi. 1 22 2 ‘ 

Var(y/) = >> = yVar(yas — Riza) + —— (1 + pn Jonna} (5) 
hal MpV* ni, — 1 

where R,=E(r,;) and p¢ is the correlation coefficient between the variables r,; 

and z,;. Robson’s result {7] for the variance of y’ can be applied for N, finite 

in polykays form, and it yields (5) for NV, large. 


B. “‘COMBINED’’ UNBIASED RATIO ESTIMATOR 


Let 9. and Z,, be defined by 


L L 
> (Ni/N)in and >> (Ni/N)a:, 
h=1 


h=1 


respectively [2, p. 66]. 

Consider the following scheme: 

Take a sample of one unit per stratum, at random. Compute j,; and Z,, from 
this sample of size L (=number of strata) elements. Replace this sample com- 
pletely and take again one unit per stratum, at random. By means of this repe- 
tition with complete replacement we, in fact, generate an infinite number of 
possible samples, The selection process stops after the k-th repetition. This may 
be visualized as yielding a sample equivalent to a sample of size k from each 
stratum. The total sample size will be kL. 

The above sampling scheme has obvious theoretical advantages for it pro- 
vides us with k independent samples of size L each, which under most practical 
situations will produce samples like those obtained by selection without re- 
placement from populations large enough within strata, when the usual finite 
population correction is neglected. The independence of the samples generated 
will permit the construction of an unbiased combined ratio estimator, since 
now we can estimate its bias. 


Let us call 
l . l k 
Gudi= 55 Ge=Fi5- Je=— VI t= — Vs 
kK ta1 K int 


obtained from the ith sample of size L taken as directed above. 
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Let now 


k l n 


z. (9;/Z:) = > Ti. 


l 
K tnt kK int 


r= 


Now, the term Cov(r;, #;)= Y—RX, where R is the mean value of r; over 
all possible samples of size L, can be estimated unbiasedly by 


1.2 k 
-— u (rs — Fas) (Bs — Bas) = 5— (Sat — Fede). (6) 
Hence, a combined unbiased ratio estimator y/ is given by 
Ye = FuaX + —_ (Soe — Fesksr) 
which can be written also as 
l 


k 
but oe 5/8 )(X — ¥; 
+ on h (9/20 #;) 


Ye -_ Ver 
which clearly shows the unbiased nature of y/ . 
It is also apparent that we may write the variance of y/ immediately as 


=2 2 


l ms l ’ 
Var (y/) = > fo, + Ro,;, — 2R Cov (5;, 2) + aE (1 + piorent (9) 


and since 


we have 


with no covariance terms because of the independence of j; and 9, where 1#7’. 
Similarly, 


l e..9 @ 
Var(y2) = Var(5.. — Ris) + kkoD (1 + pi )oz,or, (10) 


where p{ is the correlation coefficient between r; and Z,. 
Now, the combined biased ratio estimator of Y, proposed by Hansen, 
Hurwitz, and Gurney [4] is given by 


Ve = (Fer/ For) X (11) 


and its approximate variance is given, Cochran [2], for large stratum sizes, by 
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Var(%.) = ~ os » Varig — Qn.) (12) 
ani A 


where Q=(V/X). The above expression can be given as 
Var(9.) = Var(Fa — Qi,.). (13) 


We may use Var(y/ ) + Var(j.:—Rz,.) from (10). This includes up to quad- 
ratic terms and hence is appropriate for a comparison with Var(#,). We then 
have 


Var(f.) — Var(y¢ ) =(@Q — RF) Jos, — 2 Cov(Set, Z1)(Q — R). 


Let now 


_ Cov(Fer, Eu) 


y 


Fas 


be the regression coefficient between §; and 2,. 
We may write now 


Var(#.) — Var(y/) = (Q — R )o:, os 2Boz,(Q — R) 
= 0:,[(Q — B)’ — (R— B)] (14) 


This expression is similar to the ones found by Goodman and Hartley [3]. 
Equation (14) shows that y/ will be more efficient than §, if R is closer to B 
than Q is. In the author’s experience, however, Q appears to be closer to B 
than R, in the majority of practical instances. In those cases §, is more efficient 
than y2. 

A similar comparison can be made with y; and the separate biased ratio 
estimator f, defined by 


L 
t= D> (Ns/N)Sn. (15) 
h=-1 


We would find then that 


fo ” 
ys 


"Tim Leary — Bi)’ —(R,— Bi} (16) 


Var(j,) — Var(y,) = 


with analogous conclusions within strata as those discussed from Equation (14). 

Some special cases may be of interest. 

a) Suppose r, is a constant, say 8. Then 9,,.=8Z,, and then R=Q; hence y/ 
and 9, are identical and, of course, unbiased. 

b) Suppose now that r;=82;, which implies that p/ =1. Then, 7,=82?. We 
have 


R=pX and yf = 5u+ Raut (R/X)(e,, — 0) (17) 
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where 


2 


(ke a ke 
The estimator y/ is still unbiased, since 
Ely!) = ¥ + RX + (R/X)(os, — E(é..)) = VY + RX + (R/X)(-¥*) = 
The exact variance of y/ in this case is easily found to be 


; 2X? , 
Var(y/) = Var( ver - Rz,,) oe SSeS C; (18) 
k(k — 1) 


where C’, denotes the coefficient of variation of Z;. 

If we ‘ignore the last term in (18) we would not be penalized heavily. For 
example, suppose k=5 and C,, =0.80. This last term would be 0.041(RX)?. So, 
even in case of a parabolic re lationship between gj; and Z; that goes through the 
origin, y/ is still unbiased, and its variance may be given by (18) with the last 
term neglected. 

For this particular case, the bias of #. may be studied through the following 
approximation, (see equation (24) below): 


; l 
Bias in 9. = Cov [ (Fer Ee), #,| = ry Cov(r,, %;) 


which may be written as 


l 
Bias in Ve = : pi Or ,08; 


and since r;= 82; and hence p} =1, 


2 


i , mY 
Bias in §. = (8/k)oz, = % C2,. (19) 


If we use the numerical values assumed for C, and k, 0.8 and 5, respectively, 
we would have 


Bias in 7. = 0.128RX 
which may not be negligible. 


NUMERICAL EXAMPLE 


The state of Iowa is divided into nine regions called Crop Reporting Districts 
(CRD), starting in the NW corner and ending in the SE corner of the state. 
Each CRD will be considered a stratum in the present example. 

The characteristic under study will be the acreage under grain corn in Iowa 
in 1956. The sampling unit will be the county. The number of counties per 
stratum ranges from 9 to 12, and the total for the state is 99 counties. The 
auxiliary variable x will be the acreage under grain corn in Iowa in 1955. The 
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data were obtained from the 1955 and 1956 Annual Census of Agriculture for 
the state [la, 1b]. 

A sample of two counties per stratum was drawn, following the scheme of 
complete replacement described in Section B. 

The results of the sample are summarized in the following table. 
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TABLE 1. SAMPLE OF COUNTIES IN IOWA FOR ESTI- 
MATING ACREAGE UNDER GRAIN CORN 


(Data in thousands of acres) 





Acreage under 


Stratum 





11 


The overall mean X is 103.29 th. acres/county 





Counties 
Selected 


Emmet 
Lyon 


Kossuth 
Mitchell 


Fayette 
Winneshiek | 
Calhoun 
Harrison 


Boone 
Tama 


Clinton 
Muscatine 


Adair 
Fremont 





Decatur 
Madison 


Wapello 
Vashington | 


Grain Corn 





1955 (Zni) 


1956 (yas) 


x, 





87 
126 


99 


0 
8 


a 


134.4 


108. 
101. 


122. 
134. 


137. 
101. 


146. 
76. 


61. 


125.: 


46.6 


51.2 


60. 


110.6 


84. 
128. 


129. 


111. 


126. 
75. 





8 
4 


92.: 


105.: 


122.4 


134.2 


58 .6 


97. 








.036 





| 0.924 | 


> 
dé. 





| 





that j..= 99.503 ; Z,,= 102.404; 7,,=0.976. 
The estimates of the mean acreage per county and the total acreage under 
grain corn in the state are 
a) Separate unbiased ratio estimator: 
y: = 100.33 th. acres/county. .. Total acreage (grain corn) = 9933.09 th. a. 


b) Combined unbiased ratio estimator: 


as 


. From the table it is found 


yZ == 99.87 th. acres/county. .. Total acreage = 7 = 9886.72 th. acres 
c) Combined biased ratio estimator: 
5.= 100.36 th. acres/county. .. Total acreage = T= 9936.03 th. acres. 


“ 
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Now the total acreage under grain corn in 1956 was 9877.1 th. acres for the 
state, from the 1956 Annual Census [1b]. 

The estimated variances of these estimators are found to be 

a) Separate unbiased ratio estimator: 


v(ys) = 1.761, 
by using 


-2 


: -- Ni 2 22 2 
vy) = Do 2 { 8yn, + F824, — WaSynen, + 2Gn — Fra) | 
hel MalV “ 


where the term 


(1 + pn Jons,Sra, 
na- ] 


is estimated by 2(9,—7sZ,)*, since n,=2 here, (h=1, 2, - 
b) Combined unbiased ratio estimator: 


2 22 
’ , ie = > oz = 
v(lye) = 35, + et82., — 27854, = 1.760. 


c) Combined biased ratio estimator: 


2 ff tt er e 
v(Fe) = 85,, +> (Fee/ Ze) 8e,, - 2(Har/Ler)85,2,, = 1.780. 


In this example y/ is slightly above 1 per cent more efficient than 7,, and 
the estimate provided by y/ is unbiased. It seems that Q and R are roughly 
equidistant from B in this particular instance. Hence, y/ is preferable here. 

No separate biased ratio was attempted due to the smallness of the sample 
size within strata. 


C. AN APPROXIMATELY UNBIASED RATIO ESTIMATOR 


It is well known [Goodman and Hartley 3, Hartley and Ross 5] that the bias 
in §=(9/Z)X is given by 


—Cov(=, #) - P- XEG/2) (20) 
z 


which cannot be estimated from one sample. 
Let 


s9=(9-—YV)/¥; «5 = V(1 +45) 
and 
é@ = (& — X)/X; . & = X¥(1 + 62). 


It is assumed that | 6z| <1. 
We may obtain the Taylor expansion of the expression 


(9/2) = (Y/X)E(. + 89)() + 82)-?. 
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It can be seen, after expanding and taking expectations in (22), that we may 
write 


 [E(ay.Atz,) — QE(A*z,)] 


ie — 
Cov (2. <e -» (1/k) Cov(r;, Zi) = ~ py 


2 


k 4 4 we. 
+ * [(ay.d" z:) — QE(A x,)] — 3k(k — 1)C(¥ — BX) (23) 


where Ay;,=y:—Y, and B is the population regression coefficient between 4; 
and z,. Hence, 


(24) 


and this relation holds exactly up to quadratic terms in the coefficient of varia- 
tion of #. From (24), an approximately unbiased ratio estimator can be con- 
structed. Since 


1 k 
e| ~~ - a= 9] = 8[— 6 - ro] - coversco, 
we may write 


ar (9 — Fz) (25) 


and we call y{ an approximately unbiased ratio estimator. 
We want now to evaluate the mean square error of y; to compare it with that 
of 9. Let q=(g/#) and Q= Y/X. Then, for N large, 


7 = I P 
E(yd — f)? = X°E(q cae Q)? + | — ; i= ra) | 
(a) (b) 
ox | 4 
ie El(q — Q)(6 — 7a)|. 


(c) 


Evaluation of the above three terms: 
(a) X?E(q—Q)?: By utilizing the definitions given in (21) we have 


Y 
(q—-Q)?= : [°F + 8°% — 26jst — 2(6*F + 8°H5F — 257622) | 


up to terms of third degree. Hence, 


¥E(q—Q)' = V'[C5 + Ci — 29C,C, — 2E(8 = + 8 58% — 2858 2)] (27) 


where C?}=03/Y?, etc., and p indicates the correlation coefficient between y; 
and Zi. 
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b) # y | = : E : (F |. 
(b) | — 0-1 2 Fee 9 —Tt 


Now, 
k a 
e| —— (¥ - | = Cov(r;, 2) = Y — RX. 
Therefore, 


al : ] ly | : } +. LP RX)*. (29) 
? (9 —7F)| = — V: (j — Fz —(Y- 2 (2¢ 
ha peng y r ie ar | (9 rx) ke ) 


Goodman and Hartley [3] have given 


k 
Var | — (9 — ra) | 


LL mcPsaind te A) a al | 
5 4 Pil i ee ee Vi, Li 
-| s ay en i OV 


by using kay-statistics. We may write (29), therefore, as 


l 2 
al (%— r) | 
k— 1] 
(Y — RX)? 


~ B(A" jA’t) . “we [1 —| (30 
riAn) + a 07.0%, = as 30) 


where Ar;=2,;— X, ete. 


2X 
c) — El|(q — Q)(6 — #2)|: 


Before taking expectations we may write 


ig ~ 8G - Mine (= - ) ee 
he Mies ae tae OE. ee § — 72). 
k — | : ‘ k eee Zz - k i 3 of 


Hence, 
2X 2X y? ST 2 ein 
—s El(q — Q)(5 — 72) = — & - BA) | — (Y¥ — RX). (31) 
To evaluate E(j*/Z) we use again the 6-notation defined in (21). Taking 
expectations after expansion of (1+46Z)~', we have up to cubic terms only 


y? 
E(9'/2) = — zl + C5 + Ce — 20CiC; — E(0' + 8 962 — 2698 2)]. (32) 
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Now 
1 
E(57) = YR+ . Cov(y;, ri). (33) 
We have, therefore, substituting the last two expectations in (31), that 


2X , 2Y? 2 2 nt 
ro Fla — Q)(9 — 72)| = ~ -[1 + C5 + C: — 2pC;C: — E(e ) 


sgaine xX 2Y 
— ¥YRX — = Cov(ysr) |= : —(¥ — RX) 


where 
E(¢*) = E(8% + & 76% — 269677). 


Now, from Equation (30), we may write 


E 
| =5 (j — ri) | 
(Y - 


l RX)? 
= zs | Cov(atr, A*xr;) + - k == (= + (k — 1)?+ i) | (36) 


where 


Cov(ri, xi) _ Y - RX 


O2,0r; 02,07, 


The term Cov(A*r;, A*z,) is a high-order moment. It is further divided by 
k®. We may safely neglect this term. If we do, then by substituting the results 
from (27), (34) and (36) in Equation (26) we obtain, after grouping similar 
terms, 


But — T= (— = -)y’ (C3 + C3 — 2pC5C2z) + ~ 
k— k(k — 1) 
= Y°E(¢*) + te wes + (k —1)?+ | (37) 
k-1 ; k3(k 1) 
2X YRX 
a” os eeateedagae' 5 \s 


Let us work with E(¢*) first. From (21) we can easily see that 
E(z* 
Est = bt ~ 3C; — 1. 


In a similar way we find that 


2 E(x 
E(6 982) = - > — Cy — CC, — 1 
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E(3z*) 
E (696 *z) = yr = C; = 2pC5C; — 1. 


27z? 


E(¢%) = om oa) — (C5 + Ce — 2pC,C:). 


elbe + D rir + my tava) 


E(z,) f (k _ ). 
ba + 3 ke 


Tn a similar fashion we find 


2 ; 
) 1 E(yai) k-1! — 1 
= - == + eae pt (Ci, T 4 20C,, Cz ) re. a 


E(y 2 
"mY kt YY k? 


(jz) 1 Ea) k-1 
eee een ea enon ee ae 
YX: kt VX peor d+ wt 


and therefore 


—— 1 (= 4 Yar we l (C2 402 —29C.C.). (45) 
n\o*) = — Bi — ——. aa = Ws ! ‘2, “2 yi z,/- 5 
ke \X:  PeY VR k? 4 


We may evaluate now the term Cov(y,, r;) that appears in (37) 


Cov(y:, r;) = E(yr) — YR = Ely./z,) — YR. (46) 
Let 6x;=(2;—X)/X, when we assume that | 2z,! <1. Also, dy;=(y,—Y)/Y¥ 
2. Elyi/z) = (Y°/R)0 + By: + 6 yi) (1 + bx) 


from which, after expanding and taking expectations, we obtain 
Yf2 


j y 
E(y./z)) = ¥ [1 + Cy, + C2, — 20C,,Ce, — E(8 2; + 8 ydx; — 2ysz,)] (47) 


up to cubic terms. 
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Now, from the definition of 6x; and éy, it can be shown that 
E(®x; + &y 6x; — 25y,622x,) 
ry yati 
i ee te 
Therefore, 
72 


Cov(y:, rx) : "R+ zy (Ci, + C:, — 2C,,C:,) 


2 3 2 2 
Yt, = Qy ati 

E ( — + —_. os). (49) 
a Y?X YX? 


We can now write the mean square for y, , after substitutions for E(¢*) and 
for Cov(y;, r;) are carried out in (37), and after cancellations, as 


- i 2 
Ey — Y)’ = = (Cy, + Ce, — 2pCy,C:,) 


+f ore + (h pe+1] (50) 
R(k—-1) Lp? ; 

Now the mean square for 7=(7/Z) X has been given in (27). Hence, from 
Equations (41) to (44) we may write 
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Pe te ae 
BG - Vt = (14 F)— Cut Ch, ~ Cnn) 


- E{xi( 2} 
— Fay Pied — Qee . 


(1/X) Cov{ xs, (yi — Qx)*} = (1/R)E{a(ys — Qri)*} 


— V'(C), + Ca, — 2pCy,Ce,). (51a) 


Therefore, from (5la) we may write (51) as 


oY 2 a 
E(§ — Y)* = =~ (Cy + Ce, — 20C Cas) — Taz Cov{ xi, (ys — Qui)?}. (52) 
We may write Equation (50) as 


= 2 Y? 2 2 
E(yd ? Y) = eT (Cy, + C;; =r 2pC,,C2,) 
(53) 


l 2 2 2 
ey. 4 — 2k f 
+ BER 5 ee 9)Cz,[1 + (k — 2k + 2)p" | 


where $= X7. 
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[t is cle var from the last two equations that in order to have MSE(y/ ) 
— MSE(%#) <0 we must satisfy 


: 5 2 
(Var 9)C2,[1 +(k —2k+2)p' |< —- ¥ Cov{zi, (yi — Qui)?}. (54) 


¥ 


To satisfy this condition the covariance term in (54) must be negative and 
large. Such would be the case if the fit of the line y;=Qz;+e; improves with 
increasing x;. In Example 1 below the inequality (54) is satisfied for there we 
have k=2, Var $=2.236, C?,=0.224, p’=0.17, ¥ =7/2 and 


Cov{zi, (yi — Qxr)?} = — 3.1025. 
Hence, 


| 2 2 
; (Var 9)C2,[1 + (k — 2k + 2)p"] = (0.50086)(1.34) = 0.67 


—(2/X) Cov{ x, (y; — Qz,)?} = 
EXAMPLE 1 
Goodman and Hartley [3] use a population of four pairs (x, y), where the 


values are (2, 2), (2, 6), (4, 6) and (6, 10), to illustrate several estimators of 


the population mean of the y’s. We have here X =7/2, Y=6 and the sample 








Estimator 





Yi + Ye be 
(y/2)X = [222 ]am 
X? 
Ss i 
@§ + 7, 0-™ 
N-1 1 
-9+-—>-7;-,9"-™ 





=4(y + es 


The gain in efficiency by using y/ instead of 9 is 106 per cent. 

Let a= (2, 2); b= (2, 6); c=(4, 6) and d= (6, 10). The six point estimates of 
the y-mean, utilizing 9 and y/, from all possible samples of size k=2 from the 
(abed) cncmamane are 








Sample 





ab 
ac 
ad 
be 
bd 
ed 














UNBIASED RATIO ESTIMATORS 
From these results we find 


Bias in §= E(¥) — Y= +0.0866. 
Bias in yd = E(y/) — Y = —0.048. 


From (52) and (53) it may be seen that, when the sample size k is large, the 
estimator y/ is about as efficient as 7. Hence, the MSE of y/ should be esti- 
mated by 


MSE(y/) = MSE(9). (54a) 


For small samples, however, it would be worthwhile to examine whether 
condition (54) is satisfied in order to utilize y/ to gain in efficiency over 9. In- 
cidentally, Equation (52) may be used to obtain an estimate of the MSE of 4 
for small samples. 


D, THE ESTIMATOR Yeo IN STRATIFIED SAMPLING 


It is obvious that a separate approximately unbiased ratio estimator may be 
formed by applying y/ within strata. Let it be denoted by y/,. We have then 
LN, ; 


a = a 55 
y dy ue (55) 


, — i l —_ 1 
Yar = (9rn/Fn)Xa + ; (Fa — FrFn) 





Ni | leas 


and its approximate variance may be given by applying (54a) within strata 

Let us now consider a sample of size k drawn from each of L strata by means 
of the complete replacement scheme employed in the derivation of y/ in Section 
B. It is easy to see that an approximately unbiased combined ratio estimator, 
denoted by yi., is given by 


, a w=) l 
Yac = (Fer/ Ear) X + os k bes (Fer we Fo:Ee1). (56) 


We can follow the derivation of (53), with obvious notational changes, and 
write 


E(yse — Y)* = E(Ge — Y)?, (57) 
and we can estimate this expression by 
MSE(y..) = [MSE(5.)]. (58) 


Let us take the example, of counties in Iowa given in Section B. We find that 
yi. = 99.86 th. acres/county, and 7'(y/,) =9886.33 th. acres in the state. 
E. AN APPROXIMATELY UNBIASED RATE. 


A problem frequently encountered in practice is that of estimating the 
proportion or “rate” of two population means, Y and X, from a random sample 
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of k pairs (x, y). The almost universal practice is to estimate the rate Y/X by 
the rate of the sample estimates of Y and X, namely, by 9/2. The bias of 9/z 
is, from (20), 


We could use the estimator y/ developed above to obtain a rate with smaller 
bias than that of 7/Z and hope that it would be more efficient than g/Z. For 
this we can divide y{ by X and consider #/X as approximately unity, which 
may not be too remote an assumption for large samples. The rate thus obtained 
is given by 


(59) 


We may, however, present a derivation of r* that is more palatable to the 
statistician, as followst: 
From equation (24), Cov(#/Z, @)=1/k Cov(ri, 2;). 


Hence, 

5 ie 1 Cov(r:, 

u(2) ET PB 
Eo BR X 


We attempt an estimator in the form of a linear function of 9/Z and 7, as 
follows 


r* = oF + c2(9/2) (60) 
where ¢), ¢2 are constants. Now 


Bir") = (arte) = Y Son ee Cov(ri, ti) 
L(r = (Ci + Ce) = c c eee - snes 
(C1 4 1 2, ¥ 


We impose now the conditions 
C(+e2=1 and + ¢:/k = 0 


so as to have r* unbiased. One easily obtains 
and therefore 


The variance of r* is given by 


1 
Var(e") = 2 |e Var(5/#) + Var(?) — 2k Cov (¢ z *)]. (62) 
zx 





t This derivation, as well as expression (59), is due to Dr. H. O. Hartley. (Unpublished lecture notes, 1954.) 
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To gain in efficiency with respect to the estimator 7/# we must have 
Var(r*) — Var(9/z) < 0 
which leads to 
(2k — 1) Var(#/%) + Var(?) — 2k Cov(#/z, 7) < 0. (63) 
To evaluate the covariance term in (63) we use the approximation [8, p. 57]. 
Cov (2. =) += ———— Cov E - 0) u,z— a | (64) 
“% »v E(u) E(v) E(u) E(v) 
for the covariance of two ratios, in order to obtain 
Cov(5/#, 7) = (1/X) Cov(s — Qz, 7 — R) 
= (1/kX)[Cov(y;, r) — Q Cov(r;, 2] (65) 





where Q= Y/X. 
Now, from (28), Cov(r;, 2;)= Y=RX, and from (49), ignoring cubic terms, 
we have 


Cov(y:, 7) = (Y°/X) — VR + 2(¥'/X)(C), + C2, — 2C,,C.,) 






so that we obtain 


2 
Cov(5/z, 7) = a= (Ci, + C2, — 2pC,,C.,). (66) 


Also, up to second degree terms, the variance of (7/Z) may be given by 
Q 
k 


Var(5/2) (Cy, + C2, — 2pCy,C:,). 







Hence, substituting in (63) we have 
Q? 


7 (Cac + Cay — 20CyCz,) + VarlA) <0 


(2k — 1 — 4k) 











which may be written as 


_— < Var($/2) ‘67 
1} ok ar(7 (67) 






which is an approximate condition to be satisfied if r* is to be more efficient 
than (7/2) as an estimator of Q. 
From (62) and (66) we may write, up to second degree terms, 






Varrt = (0%, + C2, — 290.) 








+ [or — 3Q°(C), + Ce, — 2C,,C.,)]. (68) 


1 
(k — 1)? 
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Thus, for k reasonably large, say k>25, r* is about as efficient as #/Z for 
estimating Q. For small samples we may investigate the condition given in 
(67). Let us take the example given by Goodman and Hartley [3] already 
utilized in Example 1 above. We have there Q=1.7143; Var (7) =0.1825 and 
Var (g/#) =0.0753. Then, from (67), 

0.1825 


——— = 0.0365 < 0.0753 
L?-4 


so that we would expect a gain in efficiency by utilizing r* instead of 9/%. In 
fact, we obtain 





Estimator of Q = (7 /X) | Mean Square Error 





i 0.0309 
(9/2) 0.0749 








The gain in efficiency by using r* over (/Z) is 142 per cent. 


NOTE 


Two recent papers, by Nanjamma, Murthy and Sethi [6b] and by Murthy 
and Nanjamma [6a] give results of a similar nature to the ones presented in 
this paper, although their selection scheme is not with equal probability. The 
first of these papers [6b] gives, among other results, unbiased ratio estimators 
constructed by selecting one out of the N population pairs (x, y) with probabil- 
ity proportional to the z-variable, replacing it and selecting the (n—1) remain- 
ing sample pairs (x, y) with equal probability and with replacement at each 
draw. This procedure is extended to apply to stratified sampling. The second 
paper [6a| derives the rate estimator r* given in (59) by estimating the bias 
in (7/Z%) from interpenetrating samples. 

I am indebted to Mr. J. N. K. Rao for calling my attention to these references. 


ACKNOWLEDGEMENT 


The author wishes to express his gratitude to Dr. H. O. Hartley, who sug- 
gested the problem and gave important advice during the preparation of this 
paper. Also, the writer thanks Dr. W. H. Williams for his comments and for 
many enlightening conversations on the problem of biased and unbiased esti- 
mation, and the referee for several constructive suggestions. 


REFERENCES 

{1} Annual Farm Census. State of Iowa. a. For the year 1955. Bulletin 92-Q. b. For the 
year 1956. Bulletin 92-R. lowa Department of Agriculture. Des Moines, Iowa. 

[2] Cochran, W. G. Sampling Techniques. New York: John Wiley & Sons, Inc. 1953. 

[3] Goodman, L. A. and Hartley, H.O. “The precision of unbiased ratio-type estimators.” 
Journal of the American Statistical Association 53 (1958), 491—508. 

[4] Hansen, M. H., Hurwitz, W. N. and Gurney, M. “Problems and methods in a sample 
survey of business.” Journal of the American Statistical Association 41 (1949), 173-89. 





UNBIASED RATIO ESTIMATORS 87 


[5] Hartley, H. O. and Ross, A. “Unbiased ratio estimators.” Nature 174 (1954), 270-1. 
[6] Mickey, M. R. Some finite population unbiased ratio and regression estimators. 
Journal of the American Statistical Association 54 (1959), 594-612. 
[6a] Murthy, M. N. and Nanjamma, N. S. “Almost unbiased ratio estimates based on 
interpenetrating subsamples.” Sankhyd 21 (1959), 381-92. 
[6b] Nanjamma, N.S., Murthy, M. N., and Sethi, V. K. “Some sampling systems provid- 
ing unbiased ratio estimators.” Sankhyd 21 (1959), 299-314. 
(7] Robson, D. S. “Applications of multivariate polykays to the theory of unbiased ra- 
tio-type estimation.” Journal of the American Statistical Association 52 (1957), 511-22. 
[8] Williams, W. H. “Unbiased regression estimators.” Unpublished Ph.D. Thesis. Iowa 
State University Library, Ames, Iowa. 1958. 








A REPRODUCIBLE METHOD OF COUNTING 
PERSONS OF SPANISH SURNAME 


Ropert W. BuEcHLEY 
California State Department of Public Health 


Counts of “Persons of Spanish Surname” in the Southwestern United 
States were made by the U. 8S. Census in 1950 and are planned for 1960. 
Similar counts from other records can furnish numerators to go with 
denominators from the Census, but difficulties arise in explicit listing 
and definition of “Spanish Surname”. The problem was here approached 
from name counts of maiden surnames of mothers of infants with 
Spanish surnames. A log-normal distribution, fitted to this listing, al- 
lowed estimation of the number of names required for explicit listings 
having various proportions of coverage of all Spanish surnames. Since 
very long lists are required for high per cents of coverage, a 306-name 
list, with seventy per cent coverage, of a California population, is pro- 
posed for both Census and administrative use. Such a list would gen- 
erate accurate rates for that portion of the population covered, even 
though the per cents covered might vary. 


INTRODUCTION 


CCURATE counts of total population and of definable sub-populations are 
A required for a multitude of purposes in today’s complex society. For some 
purposes, counts of the persons within defined geographic areas are required. 
Thus, representation in Congress is based on “ . . . counting the whole number 
of persons in each State excluding Indians not taxed.” [2] For other purposes, 
the definition, delineation and delimitation of subpopulations is more diffi- 
cult. These definitions must, however, be explicit and reproducible, so that 
the characteristics of the defined sub-populations can be accurately measured. 
Demographic studies, including the epidemiologic research in which the writer 
is engaged, require explicit definition and separation of sub-population groups. 
Computation of lung-cancer death rates for the population of Spanish surname 
in California was made difficult by the problem of reproducing, from death 
certificate information, the definition of “Spanish surname” as used in the 1950 
Census of population. [1] The numerators, from vital statistics records, were 
not easily made congruent with the Census denominators. The method de- 
scribed below avoids these difficulties by developing, and proposing as a stand- 
ard, an explicit and reproducible method of counting persons of Spanish sur- 
name. 

BACKGROUND OF THE PROBLEM 


The United States has, in the Censuses since 1930, attempted to count the 
Spanish-named, Spanish-speaking population of Mexican origin within its 
borders. This population, and its members, will be hereinafter referred to by 
a Latin-American label, “Ladino”, to avoid both lengthy circumlocutions and 
attitudinal components. Good reasons exist for the attempt to be made— 
equally good reasons exist why it has not been a complete success. Ladinos are 
a major unassimilated group in the Southwest. Immigrants from the rest of 
the United States to Southwest Texas, New Mexico and Southern Colorado 
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have not imposed their culture on the Ladinos already present in those areas. 
In other Southwestern areas, where the Anglo culture was first to replace the 
Indian or has replaced the Ladino, recent immigrants from Mexico and their 
children continue to be distinguishable. In the whole Southwest, lines are 
drawn, separating the cultural and linguistic group “Ladino” from the cultural 
and linguistic group “Anglo”. The Ladino group furnishes much of the farm 
and mine labor in this area. To the extent that administrative and health prob- 
lems are concentrated in this group, the population of the group must be 
counted in the Census, and must be distinguished and counted for the adequate 
solution of the problems peculiar to the group. 

In 1930, the Census Bureau counted “Mexicans”, as seen by enumerators, 
as a race. Problems arose in the Northern New Mexico-Southern Colorado 
area, where persons in the indigenous Spanish-speaking population consider 
themselves “Spanish-Americans” and reject identification with immigrants 
from Mexico. Problems arose in other areas, too, where Ladinos are rare, and 
thus, do not have high visibility to enumerators. After the Census it was 
pointed out that “Mexican” is legally a nationality, not a race, and, since 
the other difficulties were not minor, “Mexican” is no longer a race for Census 
purposes, and persons called “Mexican” by the “Anglo” community are con- 
sidered as belonging to the white race. 

In 1940, the Census Bureau asked a question about “Language spoken in the 
home in early childhood”. In the Southwestern states, if this language is 
Spanish, a strong presumption may be made of a Spanish or Mexican cultural 
origin. Census tabulations of the characteristics of those white persons who 
reported Spanish as their mother tongue bear out this presumption. This 
method, while valuable sociologically for other nationality and language groups 
as well as Ladinos, has a severe drawback—it is not routinely available from 
administrative records. Numerator data for rate computations would require 
asking the question about language again for eack person at the time of estab- 
lishing the administrative record, a quite impossible task in investigations of 
differing mortality patterns. 

In 1950, a special count of white “Persons of Spanish Surname” was made 
for the five Southwestern states—Texas, New Mexico, Colorado, Arizona, and 
California. This count was made from the Census schedules, which contain the 
names of all Census respondents. For most areas, the results of the tabulations 
of characteristics of “Persons of Spanish Surname” are in reasonable accord 
with the 1930 and 1940 tabulations for “Mexicans” and for “Persons of Span- 
ish Mother Tongue”, respectively. This method has major advantages over 
the previously used methods, as it requires no new questions on the Census 
schedule. Since the surname routinely appears on administrative documents in 
the same form that it appears on Census records, it requires no new or diffi- 
cult-to-get information on administrative documents. That is, the method is 
reproducible. The Census Bureau’s opinion is: 


“In terms of the available evidence, there appears to be no reason for assuming that 
the use of surname to identify the Spanish-American population is any less adequate 





1 The Latinization of Indians, both in Mexico and in the United States, raises some difficult and quite un- 
answerable questions about the validity of this classification. 
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than procedures previously used; and, in some respects, its use may lead to a genuine 
improvement in the quality of the statistics.” [4] 


Certain drawbacks to this method appeared in the writer’s use of these counts. 
These drawbacks can, however, be minimized by revision of the method. 


FORMULATION OF THE PROBLEM 


The problem of counting Ladinos in the United States, as discussed above, 
is not an isolated problem. There are other such problems, in counting racial 
and national groups such as Negroes, foreign-born whites, ete. Why are the 
“Tadinos” so difficult to count that special methods must be used? The answer, 
in brief, is that “Mexican” is a nationality, and the persons classified as Ladinos 
are not necessarily citizens of Mexico. Neither is Ladino a race, to be handled 
as is “Indian”. Steiner’s statement, that “the term ‘Mexican’ has no exact 
anthropological, ethnological, nationalistic or citizenship significance” [3], 
does, however, overstate the difficulties. The Ladino group is definable along 
linguistic and cultural lines. It is definable to the extent that, even though 
U. 8. citizens, persons in the group continue to belong to an in-marrying cul- 
tural group that speaks Spanish and that retains a Latin-American, rather 
than an Anglo-American, culture. This culture group is known and recognized 
by its own members under the term “la raza”, and by the English-speaking 
community under the term “Mexicans”. 

Continued maintenance of culture group identity requires that marriage be 
within the group (endogamy). If endogamy be the rule, the Ladino culture 
group may be defined as “persons having those Spanish surnames typical of 
Spanish-surnamed persons who marry among themselves”. This definition can 
be put into concrete terms, and, in such terms, allows the development of a 
list of Spanish surnames typical of the Ladino culture group. 


METHODS AND MATERIALS 


The 1953 Birth Index for California showed that about 86% of the 24,149 
children who have their fathers’ Spanish surnames, from a preliminary list, 
have mothers who have Spanish maiden surnames. That is to say that endog- 
amy is 86% complete.? Both the mothers and the fathers of these children fit 
the defining category of “Spanish-surnamed persons who marry among them- 
selves”. That is, they belong to the Ladino culture group. They concretely 
embody one part of the definition. The mothers, however, have been selected, 
not for possession of any specific Spanish surname, but only for having married 
within the Ladino group. Their maiden surnames embody the other part of the 
definition, that is, they have “Spanish surnames typical of”. Thus, without ever 
specifying the exact names to be included, a list of names characteristic of the 
Ladino group may be developed. 


TECHNIQUES FOR DEVELOPING THE LIST 


The punch cards used in preparing the 1953 Birth Index were made avail- 
able by the Bureau of Records and Statistics of the California Department of 





* This particular statistic, per cent endogamy, seems to open wide vistas in the analysis of culture group co- 
hesion. For instance, from the 1953 Birth Index, endogamy is but 40% among Portuguese in California, which 
indicates that the “melting pot” is, indeed, absorbing the Portuguese. The Ladino group, by contrast, is not cur- 
rently being absorbed. 
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Public Health. These cards contained child’s surname and given name and 
mother’s maiden surname and given name. The punch cards for all children’s 
names on a preliminary list of 275 Spanish surnames were selected. The 
mothers’ names on these cards were the population for study. 

The punch cards were cleared of duplicate cards on the same child (neces- 
sary for record-keeping purposes because of changes after the original recording 
of the birth). Also removed were the punch cards for non-white children. The 
remaining cards were alphabetized on surname of mother. 


TABLE I. DISTRIBUTION OF MAIDEN SUI.NAMES OF 24,149 MOTHERS 
OF WHITE CHILDREN WITH 275 SPANISH SUR- 
NAMES BORN IN CALIFORNIA, 1953 








Total 





1. Spanish 
a. On the preliminary list of 275 names. 
b & c. On a longer reference list; variants and mis- 
spellings of listed Spanish surnames. 
d. Other definitely Spanish surnames. 
e. Possibly Spanish surname, positively Spanish given | 
name. 


Subtotal, Spanish from name. 
f. Possibly Spanish surname, determined Spanish from 
vital statistics document. 
Total determined to be Spanish or Mexican 








2. Non-Spanish 
a&b. Definitely “Anglo”; other non-Spanish surnames. 
c. Possibly Spanish surname, determined non-Spanish 
from vital statistics document. 
Total determined to be Non-Spanish 





3. Undeterminable 
Possibly Spanish surname, undeterminable from vital 
statistics document. 40 











The mothers’ surnames fall into three basic categories, Spanish, Non- 
Spanish, and Undeterminable. These basic categories can be subdivided, as is 
done in Table I, where the sub-category definitions are given. 

Fortunately, the categories requiring reference to the original document com- 
prised only 304 mothers’ surnames, indicating that the uncertainty in using 
only surnames is of the order of 14%. The information on the birth certificates 
allowed resolution of over 80% of this indeterminate group Three out of four 
were classifiable as Ladino, in category If, and only about one-sixth were still 
indeterminate, in category 3, reducing the uncertainty in the basic population 
of names to less than one-fourth of 1%. 


COMPOSITION OF THE LIST 


Spanish surnames, including variants, for which 10 or more mothers were 
counted in the tabulation of the mothers’ maiden names, are included in the 
list of 306 names given as Table II. As was noted above, spelling variants are 





TABLE II. 306 COMMON SPANISH SURNAMES 
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Rank 


Name 


Rank 


Name 


Rank 


Name 





A 


Abeyta 
Acevedo 
Acosta 
Acuna 
Adame 
Aguilar 
Aguilera 
Aguirre 
Alanis 
Alaniz 
Alarcon 
Almanza 
Alonzo 
Alvarado 
Alvarez 
Amador 
Amaya 
Anaya 
Apodaca 
Aragon 
Aranda 
Arce 
Archuleta 
Arellano 
Arellanes 
Arias 
Armendariz 
Armenta 
Armijo 
Arredondo 
Arreola 
Arriola 
Arroyo 
Arvizu 
Avalos 
Ayala 


B 


Baca 
Banuelos 
Barajas 
Barbosa 
Barela 
Barragan 
Barrera 
Barreras 
Barrios 
Bautista 
Becerra 





Beltran 
Benavides 
Benavidez 
Bernal 
Blanco 
Bonilla 
Bravo 
Bustamante 
Bustos 


Cc 


Caballero 
Cabrera 
Caldera 
Calderon 
Camacho 
Camarillo 
Campos 
Canales 
Cano 
Cantu 
Cardenas 
Carmona 
Carranza 
Carrasco 
Carrillo 
Casas 
Casares 
Casarez 
Casillas 
Castaneda 
Castellanos 
Castellano 
Castillo 
Castro 
Celaya 
Cervantes 
Chacon 
Chavez 
Cisneros 
Contreras 
Cordero 
Cordova 
Corena 
Coronado 
Corral 
Correa 
Cortez 
Cruz 
Cuellar 





227 


152 
131 
63 


95 
300 
1 
116 
304 


Cuevas 
Cueva 


D 


Davila 

De La Cruz 
De La Rosa 
De La Torre 
De Leon 
Delgadillo 
Delgado 
Diaz 
Dominguez 
Dominquez 
Duarte 
Duran 


E 


Encinas 
Enriquez 
Escobar 
Escobedo 
Esparza 
Espinosa 
Espinoza 
Esquibel 
Esquivel 
Estrada 
Estrella 


F 


Fernandez 
Fierro 
Figueroa 
Flores 
Florez 
Fonseca 
Franco 
Fuentes 


G 


Gallindo 
Gallardo 
Gallego 
Gallegos 
Galvan 
Gamboa 
Garcia 
Garza 
Gaxiola 





14 
3 


207 


45 
219 
188 

13 


Gomez 
Gonzales 
Gonzalez 
Griego 
Grijalva 
Guerra 
Guerrero 
Guevara 
Guillen 
Gutierrez 
Gusman 
Guzman 


H 


Heredia 
Hernandez 
Herrera 
Hidalgo 
Hinojosa 
Holguin 
Holquin 
Huerta 
Hurtado 


J 


Jaramillo 
Jimenez 
Jiminez 
Juarez 


L 


Lara 
Leal 
Ledesma 
Leon 
Leyva 
Leyvas 
Limon 
Lira 
Lopez 
Loya 
Lozano 
Lucero 
Lugo 
Lujan 
Luna 


M 


Macias 
Madrid 





Source: Unpublished data, California Department of Public Health. 
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TABLE II—(cont.) 





Rank 


Name 


Rank 


Name 


Rank 


Name 


Rank 


Name 





239 
275 
151 

99 


Madrigal 
Maestas 
Magana 
Maldonado 
Mares 
Marez 
Marin 
Marquez 
Martinez 
Mata 
Medina 
Medrano 
Mejia 
Melendez 
Mendez 
Mendoza 
Mercado 
Mesa 
Meza 
Miranda 
Molina 
Montanes 
Montanez 
Montano 
Montes 
Montez 
Montoya 
Mora 
Morales 
Moralez 
Moreno 
Muniz 
Munoz 
Murillo 


N 
Najera 
Nava 
Navarro 
Negrete 
Nieto 
Noriega 
Nunez 


e) 


Ochoa 
Ojeda 
Oliva 

Olivas 





250 


285 
176 
108 

81 


40 
24 


72 
37 
171 


100 
62 


Olivares 
Olivarez 
Olvera 
Ontiveros 
Ornelas 
Orosco 
Orozco 
Ortega 
Ortiz 


r 


Pacheco 
Padilla 
Palacio 
Palacios 
Parra 
Pena 
Pina 
Peralta 
Perea 
Perez 
Pineda 
Pinedo 
Ponce 
Porras 
Portillo 
Prado 
Preciado 
Prieto 
Pulido 


Q 


Quesada 

Quezada 

Quinones 
Quinonez 
Quintana 
Quintero 

Quiroz 


R 


Ramirez 
Rangel 
Renteria 
Reyes 
Reynosa 
Reynoso 
Rico 
Rincon 





58 
102 
27 
67 
96 
6 


Rios 

Rivas 
Rivera 
Robles 
Rocha 
Rodriguez 
Rodriquez 
Rojas 
Romero 
Romo 
Rosales 
Rosalez 
Rosas 
Rubalcava 
Rubalcaba 
Rubio 
Ruiz 


§ 


Saavedra 
Saenz 
Salas 
Salaz 
Salazar 
Salcido 
Salcedo 
Saldana 
Salinas 
Sanchez 
Sandoval 
Santanas 
Santiago 
Sauceda 
Saucedo 
Sepulveda 
Serna 
Serrano 
Sierra 
Solis 
Soliz 
Sosa 
Sotelo 
Soto 
Suarez 


T 


Tafoya 
Tapia 





149 


11 


Telles 
Tellez 
Torres 
Torrez 
Tovar 
Trejo 
Trevino 
Trujillo 

U 
Uribe 

Vv 
Valdez 
Valadez 
Valencia 
Valenzuela 
Valle 
Valles 
Vallejo 
Varela 
Vargas 
Vasquez 
Vega 
Velasco 
Velasquez 
Velazquez 
Venegas 
Ventura 
Verdugo 
Vigil 
Villa 
Villalobos 
Villalovos 
Villanueva 
Villareal 
Villarreal 
Villegas 


Y 


Yanez 
Ybarra 
Ibarra 


Z 


Zamora 
Zaragoza 
Zavala 
Zepeda 
Zuniga 
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common. In order to accommodate the most common of these variants, the 
revised list allows for possible alternate spellings for some names, especially 
for the alternation of “s” and “z”.’ 

It should be noted here that several names, which had been considered rare 
or atypical in previous coding of Spanish surnames “by ear”, turned out to be 
quite common. A prime example of this class of names is LUJAN, which is 
133rd in frequency, yet which had been ignored. Apparently the rarity of the 
general class of names in -AN, including, in the 306-name list, only BARRA- 
GAN, BELTRAN, DURAN, GALVAN, GUZMAN, and LUJAN; and the 
Spanish “j” pronounced like an English “h”, had combined to give LUJAN 
a non-Spanish aspect to the author. By the explicit formulation of the com- 
position of the list, its inclusion was forced. 

Also to be noted are certain exclusions, namely AVILA, MARTIN, RAMOS 
and SILVA. MARTIN is excluded for the obvious reason that, although the 
named spelled Martin in Spanish is a Spanish surname, the name spelled 
Martin in English is not—and in the United States there are many more 
Anglo MARTINS than Ladino. The other three, AVILA, RAMOS, and 
SILVA are excluded because, in the California material, they are about as 
Portuguese as they are Ladino. 


NUMBER OF, AND ENUMERABILITY OF, ALL SPANISH SURNAMES 


As a consequence of the explicit method used to develop the list of surnames, 
it was possible to make some statistical estimates casting light on the general 
problem. The rankings of names given in Table II may be considered in terms 
of the per cent of the total number of names. Taking the 306 names in Table 
II order and counting the mothers, the two most common names, GARCIA 
and MARTINEZ, make up 6% of the total number of mothers. The first four 
names, adding GONZALEZ and LOPEZ, make up 11%. The number of names 
required to count each additional 5% of the population of mothers up to 70% 
is shown in Table III. Figure 1, which depicts the relationship of the rank of 
the name to the per cent of the population covered, allows an extrapolation 
leading to the estimate that, for 90°% coverage, a list of about 1,900 names 
would be required. For 95%, about 4,500 names would be required, and, for 
99% coverage, about 28,000 names. That is, of course, if the distribution con- 
tinues in the same form. New name coinage and old name reworking would 
lead to other problems before 99% was reached, as would, among other effects, 
the very original spellings made by Anglos who hear Spanish names for the 
first time, VIGIL spelled BEHILL, for example. 


PROPOSED METHOD 


It is proposed that a short list of Spanish surnames be established by the 
U.S. Bureau of the Census, in conjunction with users of such data. Such a list 








+ Except in certain positions, where “s” is the Portuguese and “z” is the Spanish rendition. Rodrigues, for exam- 
ple, is Portuguese; Rodriguez is Spanish. The pairs of variant names were counted as one name in the statistical 
manipulations. For computer programming every variant must be considered as a separate name. One must note 
also, that tildes and accent marks, so important in Spanish orthography, seldom cross the border and never get onto 


punch cards. 
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TABLE III. NUMBER OF SPANISH SURNAMES REQUIRED TO 
COVER SPECIFIC PROPORTIONS OF A LADINO POPULATION 








Cumulative Rank Log Cumulative Rank Per Cent of Names Covered 





2 -3010 
4 -6021 

5 .7782 
10 -0000 
16 .2041 
24 . 3802 
35 .5441 
47 .6721 
64 -8062 
86 -9345 
118 .0719 
159 .2014 
225 .3522 
306 -4857 


we ee 


tw bw 














The above tabulated values indicate that Spanish surnames are distributed in a log-normal fashion. From 
Figure I, the mean may be estimated to be at log cumulative rank 1.9345, and the standard deviation of the log 
cumulative rank may be estimated at about 1.08. 

The 99% point may be estimated as follows: 

99% = +2.33 standard deviations 
1.9345 +(2.33) (1.080) =4.456 
4.456 =log 28,100 


could be used by the Bureau for coding Spanish surnames in the 1960 Census 
as a supplement to total enumeration. It could be made available to those 
users of Census data who wish to code Spanish surnames on their own ma- 
terial. The list, given as Table II, contains the 306 most common Spanish sur- 
names from California data. This list might serve as a starting point in the 
development of such a list by the Census Bureau. This 306-name list is esti- 
mated to cover 70% of the Spanish-named population, but per cent coverage 
could be computed for each area by the Census Bureau, and published as part 
of its total Spanish-name counts. 

The use of such a system, with an established list and specified and pub- 
lished per cents of coverage, would permit valid computation of rates. Users, 
making counts of Spanish-named persons from their own data, would count 
exactly the same persons in the numerator as were counted by the Census in 
the denominator. Thus, any rates computed using these standard techniques 
would be accurate even though based on only a portion of the Ladino popula- 
tion. This feature, of generating accurate rates, would hold even though the 
proportion of the population to whom the rates apply might vary widely from 
the designed per cent of coverage. 


SUMMARY 
The need for adequate Census counts of the Ladino populations, of Mexican 
origin, Spanish-speaking and with Spanish surnames, in the United States has 
prevailed over the difficulties of such counting, and, since 1930, the Census 
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Required Number of Spanish Surnames 
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CUMULATIVE PER CENT OF POPULATION COVERED 


Fria. 1. Number of names required by per cent coverage. 


Bureau has attempted this feat. The 1950 method, counting persons of Span- 
ish surname, has the virtue of being based on generally available information, 
but it also has the defect of requiring implicitly that the coder be very familiar 
with Spanish surnames. Consideration of the distribution of names indicates 
that no list can be exhaustive, so that a proportion of coverage is all that is 
possible on an explicit basis. A list of 306 surnames, found to be typical of the 
86% in-marrying Ladino population of California, is developed on an explicit 
basis. Use of such lists in the Census would enable users not familiar with 
Spanish surnames to code in exact parallel with the Census coding, and thus 
to obtain accurate rates for a specific proportion of this important sub-popula- 
tion. ft 





t The 306- (or 353-) name list (Table II) was used to classify the 1357 California decedents, born in Mexico, 
found in visual search of a listing of January-June, 1960, decedents. Excluding those with non-Spanish surnames, 
the list covered 869 of 1269, or 68.5%. Since the standard deviation in this per cent should be about 1.3%, this re- 
sult is well within hoped-for limits. 
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RESIDUAL ANALYSIS* 
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Virginia Polytechnic Institute 
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Aerojet-General Corporation 
and 
C. W. Ciunigs-Ross 
International Business Machines Corporation 


Individual value of.residuals from a least squares regression equation 
can be analyzed as a set of originally observed quantities. If the anal- 
ysis of residuals is in the form of another regression analysis, the total 
analysis of the data can be described as a two-stage regression. Such 
a two-stage approach is not equivalent to the usual, one-stage analysis, 
and therefore the resulting estimates and tests are biased. It is the 
purpose of this paper to obtain expressions for these biases. 


1, INTRODUCTION 


= the multiple linear regression model 


Y=XB+E 
where 


Y is an (nX1) vector of n observed values of a dependent variable, 

X is an (nXp) matrix of n sets of p independent variables, 

B is a (pX1) vector of regression coefficients, and 

E is an (nX1) vector of errors or residuals which indicate discrepancies be- 
tween the observed Y’s and those explained by the model. 


Least squares procedures can be used to obtain the estimating equation 
Pf = XB, (1.2) 


where YF and B are estimates of Y and B. From this relationship it is possible to 
obtain a vector of estimated residuals 


f = Y — XB. (1.3) 


Standard least squares procedures usualiy do not call for the actual computa- 
tion of the Y vector, but the advent of high-speed computers makes this a rela- 
tively easy task, and in fact, many computer programs now include provisions 
to obtain this vector of residuals. Often such residuals are used only for a cur- 
sory inspection to determine extreme values or the like, but at other times they 
may be subjected to more rigorous analysis. 

It is the purpose of this paper to examine for bias, estimates and associated 





* Eprror’s Nore: This article and the one by Goldberger and Jochems, which follows on p. 105 treat the same 
topic. The paper by Freund, Vail and Clunies-Ross was received first and was being revised when the Goldberger- 
Jochems paper was received. When the similarity was noticed, it seemed to the editor that there were sufficient dif- 
ferences of emphasis and exposition to warrant publishing both. 
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tests obtained by subjecting the residuals to a regression analysis, although the 
results obtained will also apply to the basically equivalent analysis of variance 
and covariance techniques. We shall do this by considering residual analysis as 
a two-stage estimating procedure: 


1. Estimate the original regression coefficients (B, see (1.1) above). 
2. Estimate the coefficients of the regressions of Y on a second set of inde- 
pendent variables. 


Usual regression procedures, on the other hand, imply that both sets of regres- 
sion coefficients should be estimated in one stage. 


2. THE MODEL 


Consider a multiple linear regression model, containing two sets of inde- 
pendent variables, 


Y=XB+ZC+E, (2.1) 


where Y, X, and B are as defined in (1.1); Z and C refer to the variables and 
coefficients of the second set of g independent variables, and £ is the vector of 
errors from this model. 

Using least squares procedures, obtain estimates of B only, ignoring the Z 
variables (i.e., including in the “error” ZC+£). This results in estimates 


y = xB, ; (2.2) 
and the estimated residuals 


y= Y — xB. (2.3) 


The residuals are now used as the dependent variable in a second regression 
resulting in estimates 


p* = 7C*. (2.4) 


In contrast, the usual least squares procedure for estimating the parameters 
of (2.1) would, in one stage, produce estimates, known to be unbiased, which 
are denoted 


f = BX + CZ. (2.5) 


3. CORRECTED VARIABLES REGRESSION 


It is possible to obtain unbiased estimates of the parameters of (2.1) by a two- 
stage procedure somewhat similar to residual analysis. This is done by utilizing 
the Corrected Variables Regression Theorem.' This theorem can be stated as 
follows: 

The least squares estimating equation for (2.1) already given above as (2.6) is 


P = BX + CZ. (3.1) 





1R. L. Wine: “Notes on the Theory of Least Squares,” V.P.I., 1957 (mimeographed), pp. 75-9. Also, R. L. 
Anderson and T. A. Bancroft, Statistical Theory in Research, McGraw-Hill Book Company, Inc., New York, 1952, 
exercise 14.6, p. 187. 
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Using the appropriate formulae associated with the regression of Y on X alone, 
the vector of residuals is 


P= Y— XB. (3.2) 
Using the formulae for B*, 
P= Y — X(X’X)“X’Y 
(I — X'(X’X) 7X") Y (3.3) 
DY, 
where D is an idempotent matrix. Similarly, a matrix of “residuals” from least 
squares regressions of each Z variable on the X variables can be written 
Z = DZ. (3.4) 
If we now perform a least squares regression of Y on Z, resulting in the estimat- 
ing equation 
P = 2, (3.5) 
the following results are obtained: 


A. The least squares estimate ¢ is the unbiased estimate of C obtained as in 
(2.6); thus 


Cul (3.6) 


B. The variance-covariance matrix of the coefficients C is simply 


(2'Z)-. (3.7) 


C. The additional sum of squares due to the regression of Y on Z, adjusted 
for X, is given by the expression 
PP = C(2'2)C, (3.8) 
i.e., the sum of squares due to the regression of 7 on Z. Thus the residual sum 
of squares from the regression of Z and X on Y is 
BE = vy — ¥’y. (3.9) 


It should be noted at this point that corrections on B* have not been made 
to adjust for the regressions of 7 on Z. These can be made with little difficulty, 
but since this paper is a discussion of residuals, i.e., the second stage regression, 
the coefficients relating X to Y will not be discussed. 


4. BIAS DUE TO RESIDUAL ANALYSIS 
In residual analysis 7 is obtained as in (3.3), but the regression of Y is per- 
formed on Z, not Z as in (3.5). This results in the estimating equation 


p+ — 70+. (4.1) 


These estimates are obviously not identical to the unbiased estimates obtained 
from (3.5). Using the definitions of 7 and Z from (3.3) and (3.4) in the least 
squares equation for C, remembering that D is idempotent, we have 
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C = (Z'’DZ)-'Z'DY. (4.2) 
On the other hand, we have for é*, 
€* = (Z'Z)-Z'DY. (4.3) 


Inserting an identity matrix J=(Z'DZ)(Z'DZ)-—', we have 
Ce = (Z'Z)-\(Z' DZ)(Z' DZ)" Z' DY 
= (Z'Z)-(2'DZ)C, 


= (Z’Z)-\(Z'DZ)C. 
This expression can be rewritten, using the definition of D, 
C* = C — [(2'Z)-1Z'X(X'X) XZ] €. (4.5) 


The expression inside the brackets of (4.5) is an expression of the bias of c* 
relative to C and, since C is an unbiased estimate of C, also for the bias of C* 
relative to the population parameter C. 

It can be seen that the bias for any estimated coefficient ¢;*, i=1, 2, - - -, q, 
is a linear function of all the unbiased estimates of the coefficients in the “Z” 
or “residual” set. Further, if the X and Y variables are orthogonal, i.e., if X’Z 
is a null matrix, the bias term will vanish. Unfortunately, other generaliza- 
tions of the bias are of little practical use, except for a special case (see Section 
5). FF: 

The sum of squares due to the regression of Y on Z, denoted (Y*’Y*), are 
calculated in the usual manner: 


(P* P+) = C*7*20+ 


(4.6) 
= Y’DZ(Z'Z)Z'DY. 


Again, this expression is not identical to the “true” additional regression sum 
of squares obtained from (3.8). In fact, using the theorem in the appendix, it 
can be shown that 


pe pec Pp, (4.7) 


Thus, as one would expect, the additional regression sum of squares due to the 
Z variables obtained by residual analysis is less than the “true” additional re- 
gression sum of squares obtained by the simultaneous regression analysis. It fol- 
lows further, that the error sum of squares, obtained by subtraction, 


Ew f* = wry — Pepe < PP. (4.8) 
Thus, the residual variance will be over-estimated by residual analysis. As 
before, if X’Z is a null matrix, the sum of squares are identical. 
The statistic to test the null hypothesis, Ho:C =0 is 
pr pe /q 
F*(qn-p—-Q@)= . 4.9 
(¢,n — p— Q) Ev En — pi—¢ (4.9) 
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The resulting test is, of course, more conservative than indicated by the a- 
level. Thus, if significance is found according to expression (4.9), significance 
will be found if unbiased methods had been used. The power of the test is, of 
course, lower than that derived from unbiased analysis. 


5. A SPECIAL CASE 


The above results simplify greatly for cases when only one variable is used 
in the second stage, i.e., g=1. The expression for the bias of the estimated re- 
gression in (4.5) becomes 


a 2 o 
c* = (1 — Rz:x,,x2,---.x,)6, (5.1) 


where R*z.x,,x,,.--,xp is the multiple correlation between Z and all the X vari- 
ables. Since R? is always between zero and one, it follows that the residual esti- 
mate will always be smaller in absolute value than the unbiased estimate. Fur- 
thermore, R? is, by definition, always larger than any simple r? contained in it, 
i.e., R*z-x,,x,,---.xp i8 greater than the largest of r?z:x,, r?z:x,,.-.,7*z:x». Thus, 
particularly if the magnitudes of the correlations between Z and X variables 
are known, a lower bound is available for the value inside the parentheses of 
(5.1). 

For the sum of squares, a similar simplification follows. The relationship be- 
tween the residual and unbiased sum of squares due to Z, adjusted for X is: 


yer Pe = (1 — Rezx,,x,,--..X,) P'P. (5.2) 


Thus, the bias in the regression sum of squares is also influenced by the value 
of R*z.x,.x,,---,xp- The resulting bias in the value of F, which may be used to 
test the hypothesis that c=0 (see expression (3.7)), also becomes a relatively 
simple expression for this special case. Denote as F* the computed value of F 
obtained by residual analysis and as F the computed value that would be ob- 
tained from the unbiased analysis and it can be shown that 


2 
Rz:x;,X3.++-.X» 





F* = {1 —- \r 
1 cad (i — Rz*:x,,x3,---.x,)(RP:2) 


where R?7.z indicates the true relative reduction in the variation of 7 accom- 
plished by the regression of Z (the “adjusted” Z variable). 
6. APPENDIX 
It is desired to prove: py> pepe where 
i’? = Y’DZ(Z'DZ)—Z'DY 
P" f* = Y'DZ(Z'Z)Z'DY 


Y is an n by 1 column matrix 
X is an n by q matrix of rank g 
Z is an n by p matrix of rank p 
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(X| Z) is an n by (p+q) matrix of rank p+q 
D=I-—X(X'X)-X’ is an n by n symmetric and idempotent matrix. 


The assumption that (Xj Z) is of rank p+gq is reasonable in that it is equiva- 
lent to assuming that none of the independent variables are linearly related. 
It is necessary for the inverse of Z’DZ to exist. 

The result may be proved using the following two theorems. 


Theorem A 


If M is real, A is symmetric and non-negative and M’AM is conformable, 
then 


(a) M’AM is non-negative 
(b) nullity of M’AJ/=column nullity of (M| Ox) where O,4 is a complete, 
linearly independent, null set for A, i.e., 


O4 A = $(a null matrix). 
Theorem B 


(a) If A is symmetric, then there exists a real orthogonal matrix, P, say, such 
that 


P’ AP is diagonal. 


(b) If A is symmetric and non-singular, then there exists a real (possibly not 
orthogonal) matrix Q such that 


Q’AQ = I. 
From Theorem A the following results are obtained (noting that an Op =X): 


(a) Z’Z, Z'DZ are positive definite. 
(b) Z’Z—Z'DZ=Z'X(X'X)'X‘Z is positive semidefinite. 





From Theorems B and A the further result may be obtained: If H, H* are 
positive definite, symmetric, and H —H* is positive semidefinite, then H*— 
— H-' is positive semidefinite. 

(c) There exists Q such that Q’HQ=TI, and Q’H*Q=H,;"*, say, is symmetric. 
(d) There exists P, orthogonal, such that P’H,*P = K where K is diagonal. 
(Theorem B) 


Thus: (QP)’(H —H*)(QP)=I-—K is positive semidefinite; therefore, if {k;} 
are the diagonal elements of K (Theorem A) 


0<ks1 










Therefore, 
(2) 
Thus: 






IV 






r| 






(P’Q)H(QP) = 
(P'Q)H*(QP) = K 
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H = (P'Y’)"“(QP)-? and =H = (QP)(P’Q’) 
H* = (P’Q’)*K(QP)"' and H*~' = (QP)K~*(P’Q’). 
Thus 
H*" = H-' = (QP)(K-' — I)(QP)’, 


but 
(K-!—]) is diagonal with elements { (1/k;)—-1 } which are all >0, (Equation 2), 


i.€., 
(K-! — J) is non-negative. 
Therefore 
H*-' — H- is non-negative. (Theorem A) 
Hence, applying the above result to Z’Z and Z’DZ, 
(Z’DZ) — (Z'Z) is positive semidefinite. 
Hence, by Theorem A, 
(Z’DY)'|(2Z’ DZ)" — (Z'Z)"|(Z’DY) 2 0, 


Pp = f+ P+ (on substituting (1)). 














NOTE ON STEPWISE LEAST SQUARES* 
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A stepwise procedure is often used to estimate one coefficient in a 
multiple linear regression. The relation between this stepwise estimate 
and the estimate obtained by direct multiple regression is spelled out. 


1, INTRODUCTION 


FTER obtaining least-squares estimates of the relationship y=6,71+4u, one 
may plot, or regress, the residuals of this relationship against a new ex- 
planatory variable, x2. What does this residual regression (or “stepwise pro- 
cedure”) tell about the results which would have been obtained if the multiple 
relationship y = 8:41 + 6272+ had been estimated in the first place? In Section 2, 
we show that the stepwise procedure underestimates both the regression co- 
efficient of zz: and the marginal contribution of xz: to the explanation of y, in 
senses to be defined. In Section 3, we explore the connection between the 
marginal contribution of z2 and the significance of its regression coefficient. A 
numerical illustration is presented in Section 4. In Section 5, the results are 
generalized to the case of k+1 explanatory variables. 


2. PROPERTIES OF STEPWISE ESTIMATES 
For convenience, we measure all variables as deviations from their means, 
and write My for >ox:°, Mis for }ox22, My, for >oay, etc. Then 4», the direct 
estimate of 8. which would be obtained from the multiple relationship, is: 

MwM iy 

"Mu 

Miz 

Mi 


§, = 





(2.1) 
Mo —- 


On the other hand, by the stepwise procedure we first obtain b;, the estimate 
of Bi: 





My 
ie 


by 


(2.2) 





Eprror’s Nors: This article and the one by Freund, Vail, and Clunies-Ross which precedes on p. 98 treat the 
same topic. The paper by Freund, Vail, and Clunies-Ross was received first and was being revised when the Gold- 
berger-Jochems paper was received. When the similarity was noticed, it seemed to the editor that there were suffi- 
cient differences of emphasis and exposition to warrant publishing both. 

* This paper was prepared in 1959-60 while the first author was a Fulbright grantee at the Econometric Insti- 
tute of the Netherlands School of Economics. 
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Then we regress the residuals 
yEy- bx, = 


on 22, and take the slope of this regression as 62, the stepwise estimate of Se: 
M»My, 

My — ——— 

M2; Miu 


b, = — 
M 22 M 22 





A comparison of (2.1) with (2.4) shows that 
Mia 
Miu , ) 
M 2 MyM 22 . 


M2 


so that 


bs = 6,(1 — R21), 
where F3., is the squared correlation coefficient between 2; and 22. Now 
(1—R}.,) necessarily lies between zero and one, so that (2.6) implies 
Osi sh if 20 
0226 if <0. 


(2.7) 


That is, the stepwise procedure systematically underestimates the absolute 
value of 5, (unless, of course, x; and x2 are orthogonal, or 6.=0). The underesti- 
mation is proportional to R3.,. 

Further, AR?, the direct estimate of the marginal contribution of x2 to the 
explanation of y (after allowing for z,) which would be obtained from the 
multiple relationship, is 





- 2 2 
AR, I Ry.1,2 —_ 


where F*,; is the squared correlation coefficient of y on 7, and R12 is the 
squared multiple correlation coefficient of y on 2; and 2. 

On the other hand, by the stepwise procedure the squared correlation coeffi- 
cient of 7 on 2 is 


Mi 


‘a (a My, ) 
22 vy M 


11 


MiMi, ) 
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A comparison of (2.9) with (2.8) shows that 


My 
My (ar 22 ~ ) 





Min 
MyM (1 - wa) 








5 om M Mr MuM 
R:., = AR, ——=- AR,- nt 
M (ar “) Ma.M (1 Mw ) 
" i : os Mi ’ sas a MuMy, 
(2.10) 
2 
ee 
_ age aR , 
(er ar 
so that 
R; (1 — R,.:) =. A — R,.)). (2.11) 


Now, it is naturai to take the left-hand side of (2.11) as AR?, the stepwise esti- 
mate of the marginal contribution of x, to the explanation of y. This is so since 
the left-hand side may be read as 


variation of § accounted for by x, _ total variation of 7 





total variation of 7 total variation of y 
Then (2.11) implies 
in @ 31) — &..). (2.12) 


Since 0<(1—R3.,) <1, we conclude that the stepwise procedure systematically 
underestimates the marginal contribution of 22 to the explanation of y (unless, 
of course, 2, and 2 are orthogonal, or AR?=0). Thus, speaking loosely, the role 
of z, is more important than would be indicated by mere inspection of the re- 
sidual regression. 


3. SIGNIFICANCE OF REGRESSION COEFFICIENTS 


A distinct question often arises in the same context, namely: how large must 
the marginal contribution of z2 to the explanation of y be if its coefficient is to 
test out as significantly different from zero? 

Considering firstly the estimates obtained from the multiple relationship, the 
link between the F-test and the é-test of the hypothesis 6,=0 shows that 


bs \2 AR, 
(=) oes ia ae it — 2 (3. 1) 
83 (1 — Rj.1,.2)/T 


where s} is the estimated standard error of 4., and T is the number of degrees of 
freedom (=number of observations minus 3). (See, for example, Mood [2, p. 
206 |.) 

Suppose the desired level of significance requires t2 = ./s3 to be greater than 
or equal to ts. By (3.1), this requirement translates into 


a 2 
AR 
: > te, (3.2) 





(1 — Ry.12)/T 
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2 


“ P t m 
+B = re [1 — (Ry. + 4R,)]. (3.3) 


Ri..), (3.4) 


which means that for zx, to have a “significant coefficient,” its marginal con- 
tribution should be at least ¢{/(t{-+ 7) times the percentage residual variation. 

To obtain the result relevant for the stepwise procedure, multiply both sides 
of (3.4) by (1—R}.,)/(1—R?.:) obtaining the requirement 


2 


Te t 
inte, 
(1— RR?) @+T 


which by (2.10) implies the significance requirement 


(l ~ B,.)), (3.5) 





9 


2 ts 2 
Rs.6 = —— (1 = Reid. 3.6 
2 aye | 2-1) (3.6) 


4. NUMERICAL ILLUSTRATION 


The question considered above arose during the course of a study of produc- 
tion planning made by T. Thonstad and D. B. Jochems [4], and we may il- 
lustrate our results by drawing upon that study. The sample consisted of Busi- 
ness Test data for the German shoe industry—entrepreneurial anticipations, 
realizations, and appraisals, as reported in 36 monthly questionnaires over the 
period 1956-8. A detailed description of the characteristics of Business Test 
data may be found in Theil [3] and Marquardt and Strigel [1]. 

The simple relationship between planned production changes, y, and order 
appraisal, z:, was estimated as 


y= 0.472; Ry = 0.69. (4.1) 


The residuals, 7, from this relationship were scattered against several potential 
explanatory variables, among which was lagged planned production changes, 
denoted by 22. For this scatter the estimated relationship was 


§= 0.1322;  R5.2 = 0.05. (4.2) 


At first glance, the “small” correlation coefficient would appear to discourage 
the introduction of x, as a second explanatory variable. However, in view of the 
fact that R}.,=0.58, application of (3.6) (taking t. at the conventional 2.00) 
showed that zz would in fact appear with a significant coefficient. Numérically : 

2 
ty 


2 
R?.. = 0.05 = 
i “@+7? 


4 
(1 — Raa) = 5 (0.42) = 0.04. (4.3) 
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Indeed, direct estimation of the multiple relationship gave 


y = 0.342, + 0.3022;  Ry-a2 = 0.73 


(0.08) (0.14) (4.4) 


in which it will be seen that 5, exceeds twice its standard error, as required. It 
will also be seen that 


Oe) - Oa wD 


AR} = 0.73 — 0.69 = 0.04 = 0.05-———- = R?..-——__*> 
1 — Ry.1) 


(0.42) om 


in accordance with (2.11), and that 


0.13 bs 
b. = 0.30 = —— = ——_— 
0.42 (1 — Ri.) 


in accordance with (2.6). 


5. GENERALIZATION OF RESULTS 


Our results are readily generalized to the stepwise estimation of the last 
coefficient in y=Bitit - + - + BetetBesiteyitu. Let by: be the estimate of 
Br41 obtained by direct estimation of the full relationship. Let bi, - - - , by be 
the estimates of 6:1, -- - , 8 obtained by estimating the “simple” relationship 
y=Bitit --- +8.tet+u. Let § be the residual from this latter relationship, 
i.e., 


k 
y=y— DL ba. 


fowl 


Then 441, the stepwise estimate of 8;,:, is obtained as the slope of the regres- 
sion of these residuals on the variable 2,41: 


: M 
bey = ——— (5.1) 
M (41) +1) 


The squared correlation coefficient of this residual regression is 


. Ms; 
ess = haa ? (5.2) 
Further, let R?.1,.... be the squared multiple correlation coefficient of y on 
a, * ++, te and let R¥.,..., x41 be the squared multiple correlation coefficient 
of yon 2,°*+, te, tex. Then AR2?=R?,,...24:—R21,...% is the marginal 
contribution of z,4: to the explanation of y. 
As suggested in Section 2, AR?, the stepwise estimate of this marginal con- 
tribution, will be taken as 


AR; = R; -a@¢y(1 — | a * (5.3) 
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Finally, let Ré+1.,....4 be the squared multiple correlation coefficient of 
Te+1 OM BW, °**, Lk. 

With this notation, the results (2.6), (2.11), and (2.12), generalize respec- 
tively to 


bess = bey i(l —_ one (5.4) 


2 


R5-asn(l = Most.«++ a = AR, (1 — OR et (5.5) 


AR, = AR, (1 saa ASP (5.6) 


Since 0S (1—R?@4,).1,....2) $1, the stepwise estimates of the coefficient and of 
the marginal contribution of 2,4; are underestimates in the sense of Section 2. 
The results of Section 3 may be similarly extended to the general case. If t» 
is the critical value of ta41) = ba41»/sefy, then the significance requirement is a 
generalization of (3.4): 
2 
ot lx 2 
AR, = — (1 — Ry.1,..-%) (5.7) 
y + T y ’ 

where T is the number of degrees of freedom (= number of observations minus 
k+2). And in the context of stepwise estimation, the significance requirement 
is a generalization of (3.6): 
2 
tx 


2 
e+T (1 - R e41)-1,-+ +k) 


RFs) 2 


REFERENCES 


[1] Marquardt, W. and Strigel, W., Der Konjunkturtest, Eine neue Methode der Wirt- 
schaftsbeobachtung. Berlin-Miinchen, 1959. 

[2] Mood, A. M., Introduction to the Theory of Statistics. New York, 1951. 

[3] Theil, H., Economic Forecasts and Policy. Amsterdam, 1958. 

[4] Thonstad, T. and Jochems, D. B., “The Influence of Entrepreneurial Appraisals and 
Expectations on Production Planning,” International Economic Review, forthcoming. 





A PARAMETRIC ESTIMATE OF THE STANDARD ERROR 
OF THE SURVIVAL RATE* 


Frep EDERER 
National Cancer Institute 


This paper proposes a simplification of Greenwood’s estimate of the 
standard error of the survival rate. The simplifying assumptions are (a) 
that cases enter the follow-up study at a uniform rate, (b) there are no 
losses to follow-up, and (c) the mortality risk for successive follow-up 
intervals is constant. Numerical examples drawn from cancer registry 
data indicate that the proposed estimate yields acceptable results under 
certain realistic circumstances despite some marked deviations from the 
simplifying assumptions. A table is given for rapid estimation of the 
standard error of the survival rate. 


1. GREENWOOD’S ESTIMATE 


EOPLE who compute survival rates by the life table method commonly cal- 
Pirate the standard error of this rate from a formula developed by Green- 
wood in 1926, ref. [1]. This formula requires a series of computations. For ex- 
ample, calculating the standard error of the 5-year survival rate requires 5 
multiplications, 5 divisions, a summation of 5 terms, the extraction of a square 
root, and a multiplication. Arithmetic examples of this calculation are shown 
in references [2] and [5]. When life tables and survival rates are generated in 
great numbers, as they frequently are with cancer registry data, computing 
standard errors becomes a formidable task. This paper proposes a simplification 
of Greenwood’s formula. 

Consider a chronic disease follow-up study conducted to estimate the propor- 
tion of patients surviving ¢ years after diagnosis. As is usual in such studies, the 
patients do not enter observation simultaneously, but rather one at a time over 
a period of years. To evaluate the survival experience of the patients within a 
reasonable period of time, we close the study n years after its start (n>?), and 
apply the life table, or actuarial, method as specified in ref. [2], which enables 
us to include the available survival information on all the patients, even those 
who entered the study less than ¢ years before the closing date. In computing 
the t-year survival rate, we treat the individuals alive at the closing date but 
under observation for less than ¢ years as having “withdrawn (from observa- 
tion) alive.” We sub-divide the follow-up interval into ¢ years, and assume the 
cases withdrawn alive to have been exposed to the risk of death, on the average, 
for one-half the last follow-up year. Let 


l, =the number alive at the beginning of the kth follow-up year, 
d, =the number of deaths during the kth follow-up year, 
w,=the number withdrawn alive during the kth follow-up year, 


and 


l,’=the effective number exposed to the risk of death during the kth fol- 
low-up year. 





* This paper is based on a master’s thesis completed at American University, 1959. 
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Then we estimate the proportion dying during the kth follow-up year as 

e = d,/l , 
where |,’ =1,— 4wx, and estimate the ¢-year survival rate as 


P, = |] &, k=1,2,---,4, 


k=l 


where J, = 1 —,. In this notation the variance estimate of P, derived by Green- 
wood [1] and re-derived by Irwin [3] and Kaplan and Meier [4] is 


t 
-~ a2 . 
V(P,) = P, » Gu/ld pr. 
k=l 
The square root of this expression has been recommended as an estimate of the 
standard error of the survival rate [references 2, 5, 6]. We shall refer to it as 
“Greenwood’s estimate,” and designate it with the symbol sg. 


2. THE PROPOSED SIMPLIFICATION 


We shall simplify Greenwood’s estimate under three assumptions: . 

(a) Cases enter the follow-up study at a constant rate. Let a;=the number of 
cases entering during the ith year of the study, and n= the number of years dur- 
ing which cases enter the study. We shall assume that a;=a fori=1,2,---,n. 

(b) There are no losses to follow-up. 

(c) The mortality risk for successive years of follow-up is constant. Let q.=the 
mortality risk for the kth follow-up year, and p,=1—q,. We shall assume that 
@=q, Pe=P?'', and Pp=P,*'' for k=1, 2,---, t. 

The individuals entering during the first (n—?) years of the study, i.e., the 
first (n—t) cohorts, will exhibit w,=0, and hence lf =1, for k=1,2, - - - , t. The 
total number of cases entering during the first (n—t) years is (n—t)a, and we 
obtain successive values of I from lf =l,=lx_1px_s. Thus, for the first (n—?) 
cohorts combined, 


Li = (n — thaP,_, (1) 


for k=1, 2, - - - , t, with Po defined as 1. 

Turning to the last ¢ cohorts, we consider the jth of these, the (n—t+j)th 
cohort. No cases withdraw alive from this cohort for the first (t—j) years of 
follow-up; hence 


Y=-k,= aPy1 


fork=1, 2, -- +, (t—j). For the last year of follow-up, Uf —j41=U+_-j41— we_j41/2, 
and since all cases alive at the beginning of the last follow-up year either die or 
withdraw alive during that year, J¢_j41=@P:_;=wr_j41+ds_541. Applying the 
mortality risk to the effective number exposed to the risk of dying 

di ji = Qli-s4i, 


we have, for the last year of follow-up of the (n—t+ )th cohort 
li-j41 = @Py5/(1 + p). 
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Adding the sum of the lf values for the last ¢ cohorts combined 





X le = (t — WaPr1 + aPxa/(1 + p) 


j=l 






to the sum for the first (n—?¢) cohorts combined shown in equation (1) yields 
li for the entire n cohorts combined, 


Li = aPys[(n — k) + 1/(1 + p)]. 


From this we obtain the following estimate, designated by sz, for the standard 
error of the f-year survival rate: 







| Pig(1 + p) ua 
sy = } _eeeee P 


| d Pl + pr — b) + 1] 










Let 


ee nPiq(1 + p) 7188 


> P,.{[( + p)(n — k) + 1] 


kel 















Then, since |,=na, 






sy >= (H/lh)*”. 









The Appendix-Table shows values of H corresponding to values of P; and n.' 
From these it is possible to obtain sy by dividing by the number of cases in the 
study and taking the square root. From the H-values shown in the Appendix, 
tables of twice the standard error for t=1, 3, and 5 and /, ranging from 50 to 
2,000 have been published elsewhere, ref. [9]. 










3. EVALUATION OF THE RESULTS 





We shall consider the three simplifying assumptions of the preceding section. 
(a) Cases enter the follow-up study at a constant rate. This assumption is often 
reasonable for chronic diseases that, during the study period, exhibited no 
marked changes in incidence. The assumption may not hold for a hospital that 
has added or lost sources of patients or a central registry that has increased or 
decreased the number of reporting hospitals during the study period. 

(b) There are no losses to follow-up. Studies without any losses to follow-up 
are exceptional. However, if a follow-up study is to be worth-while, the propor- 
tion of lost cases must be kept very low, because the survival experience of the 
lost cases may be quite different from those remaining under observation, 
ref. [7]. Moreover, it has been shown that even when losses to follow-up are as 
high as 10 per cent, the effect on Greenwood’s estimate is small, ref. [8]. 

(c) The mortality risk for successive follow-up intervals is constant. This is 
usually not the case in chronic diseases, particularly not in cancer, where the 























1 The H-values were computed on an electronic computer. The author is indebted to Elliot Cramer for writing 
the program and carrying out the computations. 
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mortality rate for the first follow-up year will often be two or three times as 
high as for any subsequent year. 


Tables 1 and 2 show a number of examples of estimates of standard errors of 
5-year survival rates for cancer patients from Greenwood’s formula and from 
the H-values.* In Table 1, the cases entered observation over a 5-year period 


TABLE 1. ESTIMATES OF THE STANDARD ERROR OF THE 5-YEAR 
SURVIVAL RATE FOR CASES DIAGNOSED OVER A 5-YEAR 
PERIOD COMPUTED BY GREENWOOD’S FOR- 

MULA AND BY THE H-VALUE METHOD 
(The survival data are from the Connecticut Tumor Registry: 


cancer cases diagnosed 1947-51) 











Number | 5-year sur- 
of cases | vival rate |Greenwood’s|} H-value | Difference 

(1) (2) method method {(3)—(4}] 
(3) (4) (5) 





Males 
All sites 10 ,884 25 .008 .009 .001 
Buccal cavity 791 .35 .038 .034 .004 
Bladder 573 .28 .035 .039 .004 
Lung | 1,011 .03 .012 .012 .000 
Digestive system 3,605 13 .012 .012 .000 
Skin | 1,549 .50 .028 .025 .003 


| Estimate of standard error 
| 
} 
| 
| 


Females 
All sites 11,943 .38 .008 .009 .001 
Buccal cavity 162 44 .084 .078 .006 
Breast 2,898 46 .020 .018 .002 
Cervix Rey .53 .024 .029 .005 
Corpus 625 .62 .042 .037 .005 
Uterus, NOS | 364 .50 .038 .052 O14 
Ovary 629 | .24 .024 .035 O11 
Skin 1,178 .68 .035 .026 .009 
Endocrine glands 120 | .53 .079 .090 O11 




















(n=t), and in Table 2 over a 10-year period (n>1t). The differences between the 
two sets of results are small enough so that, if the estimates are used to assess 
the reliability of the computed survival rates, either set will yield essentially the 
same conclusions. The differences in Table 2 are particularly small. 

The two sets of results compare closely despite the fact that the data used in 
these calculations do not fit the assumptions underlying the H-value method 
particularly well. About 7 per cent of the cases diagnosed in Connecticut, 1947- 
1951, were lost during the first five years of follow-up. For cancer of all sites in 
males, the number of cases diagnosed increased from 2,411 in 1947 to 2,788 in 
1951,° and the successive proportions dying during the first five years of follow- 





* The author is grateful to the Connecticut Tumor Registry for‘ making the data in Tables 1 and 2 available. 

* Source: Griswold, M. H., Wilder, C. 8., Cutler, 8. J., and Pollack, E. S., “Cancer in Connecticut 1935-51.” 
Hartford: Connecticut State Department of Health, 1955. Cases obtained from death certificates only (comprising, 
in males, 18.8 per cent of cases of cancer of all sites, and 25.2 per cent of the lung cancer cases) are included here, but 
excluded from the data in Table 1. 
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TABLE 2. ESTIMATES OF THE STANDARD ERROR OF THE 5-YEAR 
SURVIVAL RATE FOR CASES DIAGNOSED OVER A 10-YEAR 
PERIOD COMPUTED BY GREENWOOD’S FORMULA 
AND BY THE H-VALUE METHOD 


(The survival data are from the Connecticut Tumor Registry: 
cancer cases diagnosed 1945-54) 


















Estimate of standard error 





Number | 5-year sur- 
of cases | vival rate | Greenwood’s| H-value | Difference 

(1) (2) method method ((3)-(4)] 
(3) (4) (5) 














Breast 


Surgery only 2,348 .70 .012 O11 .001 
Radiation only 21 .16 .086 .098 — .012 
Surgery and radiation 286 .63 .033 .034 — .001 











Cervix 


Surgery only 469 .89 .021 .017 .004 
Radiation only 853 56 .020 .020 .000 
Surgery and radiation 161 72 .042 .042 .000 











Corpus 

Surgery only 444 .84 .022 .021 .001 
Radiation only 157 46 .050 .048 .002 
Surgery and radiation 331 Fe i .029 .027 .002 






























Ovary 


Surgery only 264 .68 .034 .034 .000 
Radiation only 7 22 .192 .193 — .001 

















Surgery and radiation 83 .52 .064 .066 — .002 











up were .502, .215, .163, .148, and .117. For cancer of certain specific sites, the 
variation in the number of new cases entering the study and in successive an- 
nual mortality risks was even more marked. Thus, for cancer of the lung in 
males, the number of cases diagnosed increased from 234 in 1947 to 345 in 1951, 
an increase of nearly 50 per cent. Also the successive proportions of male lung 
cancer cases dying during the first five follow-up years varied markedly: .820, 
.536, .454, .000, and .250. 

In addition to the H-values shown in the Appendix, H-values have been com- 
puted under the alternate assumptions that mortality rates for successive years 
of follow-up decline at a rate similar to that for cancer, and that the rate at 
which cases enter the study increases 5 per cent per year [8]. Despite the dif- 
ferences in assumptions, estimates of standard errors from the various H-values 
are not substantially different from one another. This indicates that it is not 
necessary for the empirical data to fit the assumptions underlying the H-values 
tabled in the Appendix very closely in order to obtain reasonably good estimates 
of the standard error. The reader is cautioned in applying the H-value method 
in circumstances where death rates change more markedly with time than in 
cancer or where a radical change in the rate of entry has occurred during the 
study period, such as a marked change in the number of sources of patients. 
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APPENDIX 
APPENDIX TABLE VALUES OF H 





t-Year survival rate (P:) 





35 .40 .45 .50 .55 .60 






































.20 .22 
24 .27 
30 .33 
41 .45 
-80 .88 














-25 


.32 
.37 
-45 


-60 
1.11 





n=8 





9 . ‘ -25 .26 .26 . -25 .24 
-22 -26 .28 .28 .29 .28 .28 .26 
-29 .31 .32 .32 .31 .30 .29 

. ‘ -34 .35 .36 .35 .34 .32 
35 .38 .40 .41 .41 .40 .39 .37 


39 .43 .46 .48 .49 .49 .48 .46 .44 
.52 .57 .61 .63 .64 .64 .63 .60 .56 
-99 1.08 1.13 1.17 1.18 1.16 1.13 1.08 1.01 


SES §8aa8 
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APPENDIX TABLE—(cont.) 





t-Year survival rate (P;) 





05 .10 .15 .20 .25 .30 .35 .40 .45 .50 .55 .60 .65 .70 .75 .80 .85 .90 .95 





n=9 





05 .09 .13 .16 .19 .22 .23 .25 .26 .26 .26 .25 .24 .22 .20 .17 18 .10 .06 
05 .10 .14 .18 .21 .24 .26 .27 .28 .28 .28 .27 .26 .24 .21 .18 .14 .10 .06 
. -11 .16 .20 .23 .26 .28 .30 .30 .31 .30 .29 .28 .26 .23 .20 .16 .11 .06 
07 .13 .18 .22 .26 .29 .31 .33 .34 .34 .33 .32 .30 .28 .26 .21 .17 .12 .06 
08 .15 .20 .25 .29 .33 .35 .37 .38 .38 .37 .36 .34 .31 .28 .24 .19 .13 .07 


arwnds = 
& 


6/| .09 .17 .24 .30 .34 .38 .40 .42 .43 .43 .42 .41 .39 .35 .32 .27 .21 .15 .08 
7] .12 .22 .29 .36 .41 .46 .49 .51 .52 .52 .51 .49 .46 .42 .37 .32 .25 .18 .09 
8 17 .30 .40 .48 .55 .60 .64 .67 .68 .67 .66 .63 .59 .54 .48 .40 .32 .22 .12 
9 33.58 .77 .92 1.04 1.12 1.18 1.21 1.22 1.21 1.17 1.12 1.04 .95 .83 .70 .55 .38 .20 





n=10 





-05 .09 .13 .16 .19 .22 .23 .25 .26 .26 .26 .25 .24 .22 .20 .17 .13 .09 .05 
-10 .14 .18 .21 .23 4.25 «4.27 «6.28 «1.28 «628 «O27 «626 «28 (C21 CWS CLK C«#dO 
-ll .16 .20 .23 .25 .28 .29 .30 .30 .30 .29 .27 .25 .22 .19 .15 .11 
-32 .33 





aronbd 
SSes 
S8Ss 








SS883 

















09 .13 .16 .19 .21 .23 .24 .26 .26 .26 .26 .24 .22 .19 .17 .18 .00 
10 .14 .17 .20 .28 .26 .26 .27 .27 .27 .26 .26 .23 .20 .17 .14 .10 
‘ 16 .18 .22 .24 .26 .28 .28 .29 .28 .27 .26 .24 .21 .18 .4 .10 
11 .16 .20 .23 .26 .28 .29 .380 .80 .30 .29 .27 .26 .22 .19 .15 .I1 
12 .17 .21 .26 .28 .30 .31 .32 .82 .82 .31 .20 .27 .24 .20 .16 11 


om tem | 
SSBR8 
S 


07 .13 .18 .23 .27 .30 .82 .34 .34 .84 .34 .33 .31 .29 .26 .22 117 «£12 
08 .14 .20 .26 .29 .32 .35 .36 .37 .37 .387 .35 .33 .31 .27 .23 .18 .18 
‘ -22 .28 .32 .35 .388 .40 .41 .41 .40 .39 .36 .33 .30 .25 .20 .14 
-10 .18 .25 .31 .36 .40 .42 .44 .45 .45 .44 .43 .40 .37 .33 .28 .22 .15 
-12 .22 .29 .36 .41 .45 .48 .50 .61 .51 .50 


oc ono 
é 
S888 22888 


— 












n=15 












05 .09 .13 .16 .19 .21 .23 .24 .25 .26 .25 .25 .23 .22 .19 .16 .13 .09 
06 .10 .14 .17 .20 .22 .24 .26 .26 .27 .27 .26 .24 .22 .20 .17 .14 .10 
-10 .14 .18 .21 .24 .26 .27 .28 .28 .28 .27 .26 .24 .21 .18 .14 .10 
11 .16 .19 .22 .26 .27 .28 .29 .29 .29 .28 .27 .26 .223 .19 .16 .10 
31 .31 .30 .28 .26 .23 .20 .16 .11 













238 
88288 








(Continued on page 118) 
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APPENDIX TABLE—(cont.) 





t-Year survival rate (P;) 





-55 .60 





32 31 
-34 = .33 
37 .35 
39 .38 














When confronted with data that deviate radically from the assumptions, it is 
suggested that the reader test several H-value estimates against results ob- 
tained by Greenwood’s formula, to determine whether the H-value estimates 
are tolerable before applying them on a large-scale basis. 
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A NOTE ON FOLLOW-UP FOR SURVIVAL IN THE 
PRESENCE OF MOVEMENT* 


D. J. Toompson anv D. Kop1iin 
Graduate School of Public Health, University of Pittsburgh 


A common disturbance in follow-up surveys is the loss due to move- 
ment, a loss which makes it impossible to estimate in a straightforward 
fashion the frequency of events of interest irrespective of movement. 

The error committed by deriving such an estimate from the non- 
movers is examined in terms of the movement rate and differences of 
the event rates between movers and non-movers for a five- and ten- 
year follow-up. Whether the “event frequency irrespective of move- 
ment” should be used as a measure of risk, depends on epidemiological 
hypotheses. 


1. INTRODUCTION 


A common disturbance in follow-up surveys is the loss due to movement, 
a loss which makes it impossible to estimate in a straightforward fashion the 
frequency of events of interest. If, for instance, death is such an event, then 
the number of “deaths after movement” remains unknown, and the observa- 
tions are incomplete. A problem of this kind, sometimes referred to as “com- 
peting risk” (Neyman [5], Cornfield [1]), may occur, of course, with losses 
other than by movement. A case, for instance, which has drawn much attention 
recently ({2], [3], [4]) is that of follow-up of patients admitted to therapy at 
various points in time. On a common closing date patients will therefore vary 
in observation time; thus a patient admitted at the beginning of the follow-up 
period constitutes a complete observation while a patient admitted at n time 
units before closing time is an incomplete observation “lost” after n time units. 
There is, however, one important distinction between this case and the move- 
ment case: while the assumption of independence of death and “loss” is quite 
reasonable in the former (and thus made by the authors dealing with it) it is 
easy to conceive that the mortality of movers may be quite different from that 
of non-movers. Thus, it seems to us, the case of movement merits separate at- 
tention and in this note the possible influences of movement losses on survival 
estimates for the dependent case will be our main concern. 


2. A MODEL FOR SURVIVAL AND MOVEMENT 


Let us represent follow-up for survival in the presence of movement by the 
following model (Fig. 1): The total follow-up time is subdivided into n periods. 
At the end of the first period the group being followed is divided into four 
classes: dead and surviving movers [(m, d), (m,d)] and dead and surviving 
non-movers [(#, d), (m, d)| with the corresponding joint probabilities + (1) 
as indicated in Figure 1. The marginal probability of dying in the first period is 
p, and of moving is ri. 

The surviving movers (their relative frequency being 72 (1)) are subse- 
quently, in the second period, subject to a new mortality pS” and, if they do not 





* Read before the Meeting of the Biometric Society, ENAR, Pittsburgh, March 19-21, 1959. 
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Fra. 1. A model for survival and movement. d=death, d =survival, m =movement, 
wm =no movement. Three consecutive time periods considered. 


succumb, to ps” in the third period, ete., the superscript referring to the period 
in which movement took place. 

Similarly, the surviving non-movers (7 (1)) will have a fourfold split in the 
second period (with probabilities 7 (2)) and the surviving movers (72 (2)) will, 


(2) 


in the third period, be subject to p; 
fourth period, etc. 

Let N be the number present at the beginning of follow-up, d; be the number 
of deaths occurring in the ith period amongst those who entered the ith period 
without previous movement, including cases in which movement in the ith 
period is followed by death in the same period, d be the number of deaths 
occurring in the ith period amongst those who have moved in the jth period. 
Then, ignoring sampling variation, we have 


di 
N 
ds 
N 


and if they do not succumb, to pf? in the 


Pi (1) 
wo(1) pe 
= mo(1)m0(2) pz 


(b) 


wo(1)p2 
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qd) a 
w(1)g2 Ps 


ro(1)2(2)p3 





d 1 
sain % = — [ru(t)o(2){1 — guaran} + o(1)xs(2){1 — grants | 
i 1Ge2 : : 


+ a(I)qo{1 — qgegs }]. (2) 


The right-hand side of (7) is nothing else but the sum of the right-hand sides 
of (1) to (6) as can be seen by multiplying out (7). Now, if p)? =p, for all j, 
that is if the subsequent period mortality is the same for movers and non- 
movers, 


1 
ae [{ wo(1)o(2) + wo(1)m2(2) + wa(1)ga} {1 — gigegs} | 


1 3 
= — [maf — aggs}] =1- IL a = Pi 


1192 t=1 


and, in general, for n periods, 


However, if p? ¥ pi, 


and the error committed, if in this case we would take P* as an estimate of 
P,,, will not only depend on the differences p\” — p,; but also on the movement 
rates m2(i). (Note that for the computation of P* knowledge of p, (or gs=1—p,) 
is necessary, which implies that not only the deaths amongst non-movers are 
known but also the deaths amorgst movers occurring between movement and 
the end of the period. This assumption means that there has been a limited fol- 
low-up on all movers to ascertain whether they have or have not died at the 
end of the period in which they moved.) To get some notion of the magnitude 
of the error for such standard procedures as five- and ten-year follow-up sur- 
veys, we have computed, in Table I, for these time spans the quantity 
(P,—P)/P. for the special case 


pi = op; for all ¢ and 7 


and 
ro(t) = me for all 7, 


choosing reasonable values for a(0.375 to 3), p (0.005 to 0.10) and zz (0.01 to 
0.1). Note that no assumption concerning association in the period 2X2 table 
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is involved. Thus, if for instance the yearly “inside” mortality were p=0.01 
and were 1.5 times as large as the yearly “outside” mortality, then P? would 
overestimate Ps by 3% if the “yearly movement rate” were r:=0.05; after a 
ten-period follow-up the error would amount to 7%. It is, however, worthwhile 
noting that for certain combinations of p and me, the error after 10 periods is 
still surprisingly low, surprisingly, because the fraction lost over the entire 
follow-up period is, under our conditions, 


mo(1 -_ 10) 


ee ek at | 


1 — mo 


a quantity that can be of considerable magnitude (Table II). Thus, with 
p=0.005, r2=0.01 one has a loss of 10% after 10 years, while Table I shows 
that for a=3, the error is only 3%. Note also that after 5-year follow-up and for 
certain values of a, p and mz the error is not larger than 3% although the loss 
rate may be as high as 23%! 

Thus, a certain measure of comfort may be derived from the argument that 
P*, computed from the non-movers, may, even in the face of heavy losses, 
produce a reasonable approximation to the over-all mortality P—although the 
mortality of the movers differed from that of the non-movers. 

The question arises, however, if P is really the measure of choice for all 
follow-up surveys. Consider, for instance, an “epidemiological” investigation 
attempting to assess the effect on mortality of residents, of a postulated en- 
vironmental factor peculiar to a certain geographic location and a certain time. 
A group of “exposed” residents is followed. The movers amongst them enter a 
new environment while the non-movers perhaps continue to be “exposed”; 
even if the factor under investigation was known to have operated but briefly, 


TABLE II. 22(1—2))/(l—e), THE FRACTION LOST OVER 
THE FOLLOW-UP PERIOD (%) 


5-Period Follow-up 








0.005 0.01 





5 5 
22 22 
41 40 





10-Period Follow-up 








0.005 
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its effect on the movers might be confounded with possible effects peculiar to 
the new environment, may the latter be beneficial or detrimental; thus, P 
would fall short of its purpose, namely, to measure the factor effect as ex- 
clusively as possible. P*, on the other hand, would appear to fulfill this purpose 
much better since it gives us the mortality expected had we been able to subject 
the movers to the same mortality risks as the non-movers. 

An entirely different view concerning the relative merits of P and P* would 
have to be taken if one would be inclined toward a different hypothesis con- 
cerning the movers, namely toward the assumption that movers are “se- 
lected” in the sense that they contain a larger fraction of people damaged by 
the factor than do the non-movers. In such a case P rather than P* would 
obviously be the quantity desired. If both mechanisms, “effects of the new en- 
vironment” and “selection,” are postulated to operate, no manipulation of the 
data will permit the isolation of the factor effect! Unfortunately, most surveys 
of the kind indicated will proceed in considerable ignorance concerning the 
mechanism of movement. Sometimes it may be possible to distinguish movers 
from non-movers on the basis of pre-moving characteristics such as sickness. 
Such distinction would support the selection hypothesis and thus, as pointed 
out above, P would be the quantity of interest; if it cannot be obtained, a 
complete follow-up of movers being impractical, P* will be the only quantity 
available and the question how well it approximates P is relevant. On the other 
hand, evidence in favor of the environmental hypothesis, for which P* is ap- 
propriate, can come only from complete follow-up. Most surveys will not be 


able to achieve this; but the user of P* may perhaps console himself with the 
observation that if P* itself is not appropriate on account of a moving mech- 
anism requiring P, it will be, under the conditions outlined, a reasonable ap- 
proximation. 
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THE PROBABILITY OF REVERSAL ASSOCIATED WITH A TEST 
PROCEDURE WHEN DATA ARE INCOMPLETE! 


BERNARD 8S, PASTERNACK AND JuNJIRO Ocawa* 
University of North Carolina 


The main concern of this investigation involves the concept of “the 
probability of reversal.” That is, the determination of the conditional 
probability of not rejecting the null-hypothesis, Ho, if all the data (n 
fixed) are available, given that Ho is rejected by the test of significance 
based on incomplete data (Type 1). Or, on the other hand, the condi- 
tional probability of rejecting Ho if all the data are available, given that 
Hy is not rejected by the test of significance based on incomplete data 
(Type 2). Therefore, “the probability of reversal” indicates the degree of 
stability in the statistical inference procedure when the originally de- 
signed experiment is disrupted by loss of observations. 

The reversal function (of Type 1 or Type 2) of a test procedure is 
defined as either of the above conditional probabilities, respectively, 
when considered as a function H, the alternative hypothesis. Unfor- 
tunately, this type of conditional probability seems to be mathemati- 
cally intractable except for the simplest cases at the present stage. 


1, INTRODUCTION 


_ conduct of any statistical analysis generally involves the reduction of a 


sample, or experimental data, in such manner as to provide as much rele- 
vant information as is possible. Unfortunately, however, statisticians are some- 
times confronted with situations in which data, originally intended to be a 
part of the ensuing statistical analysis, are missing. Clearly, this type of prob- 
lem does not apply to statistically designed experiments that incorporate 
censoring or truncation devices in which observations are likely to be, or are, 
deliberately missed. As far as the authors are aware, current procedures for the 
statistical analysis of non-censored incomplete data involve, essentially, ignor- 
ing the missing values and performing the analysis on the available observations. 
In some cases, however, this type of approach is not justified. When obser- 
vations are missed due to circumstances beyond the control of the experi- 
menter, it might be more realistic to assume the existence of a certain chance 
mechanism according to which observations are missed. One such application 
involving a binomial population has been described by Barnard [1]. Consider 
another example applied to a normal population. 
Suppose that we have taken a random sample of size n, 


Oe ee 


from a normal population N(y, o?). Assume, further, that the probability of 
an observation being missed is p, and that this event is independent of 
21, X2, ++ *, Z, in the stochastic sense. That is to say, we assume a set of mu- 





1 This paper is based on a portion of a dissertation submitted to the Department of Experimental Statistica, 
North Carolina State College, by the first author, in partial fulfillment of the requirements for the Ph.D. degree. 
This research was sponsored in part by the United States Public Health Service and in part by the Office of Ordnance 
Research, United States Army. 

* Now at Nihon University, Tokyo, Japan. 


125 





126 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1961 


tually independent and identically distributed chance variables, 


ei 


{ if the ith observation x; is missed, with probability p, (1.1) 


1 otherwise, with probability g = 1 — p. 
The observations actually available to the statistician can be written as 
Yi = TK, #=1,2,---,n. (1.2) 
If k observations are missing (therefore n—k remaining ones are available), we 
will assume, without any loss of generality, that 
2=1 fort = 1,2,---,n—k 
ile fort=n—k+1,:--,n. (1.3) 


In order to test the null-hypothesis Ho: n=0, the statistic ¢* is calculated, i.e., 


pe slend es 
sy 


- 2 a. 2 
$=—Lmw &=—Lw-).- 
N jx NL fimel 
If we denote 
l n—k 
Tat = pe vy 
nm —k bull 


l n—k 

2 

Sn—k Goes > (x; — $a}, 
n—k iat 

then it follows that 


Be k 


t* = V/n—1- 





- —~ tas 
n?(n — k — 1) 


where 


p~remeg ere Sad 
t.= /n—-k—-1- 


Sn—k 
It can easily be seen from (1.6) that as t,.— ~, ¢* is bounded, i.e., 


| ¢*| < ae where (k > 1). (1.8) 


Also for fixed vector 2 (i.e., k), the conditional distribution of t,_, is the é-dis- 
tribution with degrees of freedom (n—k—1). 

It is clear that in order to be able to use the statistic ¢*, at least two obser- 
vations must be available. Since ¢* is an increasing function of t,x, we get for 
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This is not the usual ¢-distribution, but rather a compound of n—2 ¢-distri- 
butions differing with respect to their degrees of freedom. 
If a minimum value of 7 can be found such that 











inf Pu,\ | t*| < r} >1-—aforagivena, (0O<a<1) (1.10) 
O<pSl 


then that value of r should be regarded as the a-point of the statistic ¢*. 

An alternative approach to the problem of missing observations, which 
appears to be of particular importance when the existence of a chance mechanism 
can realistically be ignored, involves the concept of the probability of reversal. 
That is, the determination of the conditional probability (under Ho) of not 
rejecting the null-hypothesis if all the data are available, given that Ho is 
rejected by the test of significance based on the incomplete data (Type 1). Or, 
on the other hand, the conditional probability (under Ho) of rejecting Ho if all 
tha data are available given that Hy is not rejected by the test of significance 
based on the incomplete data (Type 2). Therefore, the probability of reversal 
indicates the degree of stability in the statistical inference procedure when the 
originally designed experiment is disrupted by the loss of observations. 

In a statistically well designed experiment the reliability of final conclusions 
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may be seriously altered from that originally desired, if some observations are 
missing. Knowledge of the probability of reversal of a test procedure, gives the 
experimenter an awareness of the extent to which his decision process may be 
affected by the loss of observations. Probability of reversal, from this point of 
view, can be regarded as descriptive of a test procedure’s performance. Analo- 
gous to a power curve, one can define the reversal function (of Type 1 or Type 2) 
of a test procedure as the probability of reversal (Type 1 or Type 2, respec- 
tively) considered as a function of H, the alternative hypothesis. 

In section 2 the concept of the probability of reversal will be formally intro- 
duced for the case in which an incomplete random sample is drawn from a 
normal population with unknown mean and known variance. If only one obser- 
vation is assumed to be missing, but the variance is unknown, the attempt to 
obtain the probability of reversal (Type 1 or 2) becomes exceedingly difficult. 
This is clearly demonstrated in section 3. 


2. THE CONCEPT OF THE PROBABILITY OF REVERSAL 
Consider a sample of size n 
Z1, Z2, toe, Ze 


drawn from a normal population N(u, o?), where the variance o? is known. 
Suppose that only n—k observations are actually available, 


U1, U2, °° * » Unk (say). 
Let 


a /n — k &* 


Z* = z Li; 2* 


n—k ica oC 


(2.1) 


The current test procedure is to reject the null-hypothesis Ho: u.=0 if 
|2*| >z*(a), and not to reject Ho if |z*| <z2*(a), where 2*(a) is determined by 


1 2*(a) 
Py,{ | 2*| < z*(a)} = —=— exp (—2z*"/2)dz* = 1 — a. (2.2) 
V24r —2*(a) 
However, the original procedure was to have been to reject Ho if |z| >z(a), 
and not to reject H, if | 2| <z(a), where 


(2.3) 


and z(a) is determined by 
z(a) 

exp (—z?/2)dz = 1 — a. 
(a) 


Pu{|2| <2(a)} -—f 


Thus in this special case z*(a) =z(a). 
We shall call the quantity 


Pu,{ | z| < 2(a)] | 2*| > 2*(a)} 





PROBABILITY OF REVERSAL 
the probability of reversal of type 1 under Ho, and the quantity 
Pu,{ | 2| > 2(a)| | 2*| < 2*(a)} 


the probability of reversal of type 2 under Hy. 
Since for any two events, A and B, with 0<P(B) <1, 
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The result given by (2.8) immediately follows. 
Consider the probability of reversal of Type 1 of this test procedure 
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i.e., the joint distribution of z and z* under H, is the bivariate normal distri- 
bution, where 
Cov?(z, 2*) n—k 
o Var(z) Var(z*) n 


Thus the probability of reversal of Type 1 and Type 2 can be computed for 
different. values of 1—p?=k/n, which corresponds to the proportion of the total 
number of observations which are missing. Tables for finding the volumes of 
the normal bivariate surface given in [3] were utilized. 

Table 1 gives the values of the probability of reversal of Type 1 (under Ho) 
for various proportions k/n; and table 2 gives the values of the probability of 
reversal of Type 2 (under H») for various proportions k/n. 

If Pua{|2| <z(a)||z*| >2*(a)}, @ being prefixed, is regarded as a function 
of H, then this represents the functional relation of the probability of reversal 
of Type 1 and the population mean value yz. This can be defined as the reversal 
function of Type 1 of this test procedure. The reversal function of Type 2 can 
be similarly defined. These functions have not as yet been tabulated but the 
authors hope to do so in the near future. 





(2.13) 


3. ONE NORMAL POPULATION—VARIANCE UNKNOWN 


The calculation of the probability of reversal under H, for the case when the 
variance o? is unknown becomes exceedingly difficult even in this simple situa- 
tion as can be seen from the following considerations. 


For simplicity of discussion, we will assume that only one observation, z, 
(say), is missing or has been discarded. 


TABLE 1. PROBABILITY OF REVERSAL OF TYPE 1 (Ho: »=0) 


k/n a = .009322 a = .0455 a =.1096 a=.3173 











.9975 -9900 -9533 -8890 -6818 
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TABLE 2. PROBABILITY OF REVERSAL OF TYPE 2 (Ho: »=0) 
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In this case, the current test procedure is to reject Ho: u=0 if | e*| >r*(a), and 
not to reject Ho if | t*| <r*(a), where 7*(a) is determined by 
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#=— ox, s? = — > (2; — 2)?, t= V/n-1—> 
N tml N inl s 


and r(a) is determined by 
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Put |t| < r(@)j 


I ) t(a) t? —n/2 
ld <scindeipeneniestaa a eiatalesecaiaclia (1 + ——) dit=1—a. (3.2) 


ag r(* -) —r(a) n-1 
/a(n — — 
Vr r 
Denote the probability density, under Ho, of the joint distribution of ¢ and ¢* 
by fu,(t, t*), then 
Pn, \t| > r(a)l || < r*(a)} 
l 


etl f fu,(t, t*)dtat* 
lL — ad \e*\s1"(a) |t|>r(@) 


1 
oft t*)dt R t*)dt*, 
= > a a” | [on 


where fz,(t| *) is the density function of the conditional probability distribution 
of ¢ given ¢* under Ho , and fy,(¢*) is the density function of the probability 
distribution of t* under Ho. 

Now, 


z? 
> rla) = (n— ie} > r*(a), 
§ 


| ¢ 


' 


and since 


ni = (n — 1)#* + 2, 


n— 
ns? = (n — 1)s** + —— (a, — 2*)?, 


in — 1) (= 4 


|t| > r(a) < ———— 
n-—-l1 , 
ena s* 

n 


we have 


The latter inequality is equivalent to 
[zn + (n — 1)2*]’ 


a 4 2 
In — &*)? + ns* 





> r7(a). 


as 
Z* = ’ 


Vn — 2 


we have the inequality 
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| =+ ee e] > r@|(=- ) +n]. (3.6) 
s* /n-2 s*  VYn-2 
The domain of z, for given ¢* and s* is 

8*-a(t*) < a, < 8*-B(*), (3.7) 


where a(t*) and 8(¢*) are given by 


a(i*) = “Tp a We = =| ¢ t* — r(a) fo" hae tataeening 2, c | 


t* 








/n —2 


ahs 


B(t*) = - =| # + (a) fe - i - LY: ‘ee 1) 
~ [r%(a) — 1]}/n — 2 = 
t* 





V/n — 2 


bor Be SPs 
|| > 4/~— Krew - 1. 
n 
Hence (3.3) becomes, by considering the fact that 2, is independent of ¢*, s* 
n—1 
(CH) 
2 
Py,{|t| > r(a)| | t*| <7*(e)} = (l—-a)" Tie eee 
n—2 

/x(n — 2) r(*=) (3.9) 


r*(a) 1 8*B(t*)/o | {*?  \—(n-1)/2 
. Eu {=f exp (—t?/2)dt o (i + ) dt*. 
f x ’ V2r s°a(t")/o n—-2 


7 (a) 


subject to 


Now, (3.9) can be expressed as 


r*(a) B(t*) x“ (n dad 1)s*° t? 
elder) 
i a(t*®) J 0 20? n—1 


r* (a) 


n — 1)s®* 7 -D/2-1 t*? ) \—(a-/2 (n — 1)s** 
{: : | (1 + ) aria] “— |, 
9a? a= 2 20? 
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Therefore, upon integrating out s*’ from (3.10), we have 
ee. 
T? 
2 


Tt . , aeeeeae 
rr?(“—)v@ — 1)(n — 2) 


(3.11) 
r*(a) t*’ —(n—1)/2 B(t*) t? —(n—1)/2 
f (1 +— ;) at*- f (1 +— 7) dt. 
—r* (a) n—2 a(t) n—1 


Thus it is impossible to represent Py, { | t| >r(a) | | t*| <1*(a)} in a compact 
form as a function of r(a@) and r*(a). 
Note, however, that a(t*) =8(t*) when 





Py,{|t| >r(a)| |t*| <*(@)} = (l-a) 


eal } + <a (3.12) 
n—2 


in which case 
Px,{|t| > r(a)| | | < r*(a)} =0. (3.13) 


This suggests that (3.12) represents a stochastic least upper bound for | ¢| 
given ¢* under Ho. This result is shown to be true for this special case where 
n*=n—1 in Ogawa and Pasternack [2]. In that paper, for n*>2 and given 
t*, the minimum value of 7 such that 


Py,{|t| >|} =0, (3.14) 
is obtained. 
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VARIANCE ESTIMATES IN “OPTIMUM” SAMPLE DESIGNS* 


Awan Ross 
University of Kentucky Medical Center 


Formulas describing optimum sample allocation in stratified and 
two-stage sample designs are given ordinarily in terms of “best” esti- 
mates of a population mean or total. The best allocation for estimating 
the variance of the mean may depart from that which is optimum for 
the mean. For stratified sampling, such a departure depends upon var- 
iation among strata in functions of coefficients of kurtosis for the strata 
(fourth moment+squared second moment). For two-stage designs 
optimum sample allocation for estimating the variance of the mean 
depends upon coefficients of kurtosis as well as costs and variance com- 
ponents. Examination of the expressions for optimum variance estima- 
tion and some calculations for specific cases indicate that losses in effi- 
ciency of variance estimates due to using optimum allocation for a 
mean are not extreme in a variety of particular cases. 


1, INTRODUCTION 


HE purpose of this paper is to examine the estimation of variances in sample 

designs where the total sample has been allocated to strata or stages ac- 
cording to procedures for obtaining “optimum” estimates of means or totals. 
As an illustration consider a stratified design with the familiar allocation of 
total sample to the individual strata in proportion to the product of stratum 
size and standard deviation. It is well known that the variance of the usual 
estimate of a population mean or total is minimized for fixed total sample size 
if such an allocation is employed. But what about the estimate of the variance 
of the mean? Does one pay a penalty for optimum mean estimation or is such 
allocation also optimum for the variance? 

Rather than searching for methods to estimate variances for the sake of 
estimating variances, we shall confine ourself to the problem of using standard 
sample designs appropriate for estimating means and inquire how well one 
estimates variance with these designs. This point of view appears to be reason- 
able in so far as the estimates of population means or totals are usually the main 
purpose of a sample survey. However, the growing interest in using sample 
surveys for analytic studies may well necessitate that in the future more 
emphasis be placed on the precision of variance estimation. 

In the course of the discussion to follow certain “optimum” designs for 
variance estimation are developed as reference points, but it should be noted 
that these are not proposals for new survey allocation procedures. They are 
standards against which we shall contrast estimation methods for the usual 
designs in order to more fully understand the characteristics of these designs. 


2, STRATIFIED SAMPLING 


Let the population of values of interest be denoted by Ya; (¢=1, 2,--+, Na 
and h=1, 2, ---, L), where 7 indexes the units within strata and A indexes 





* Presented under a different title at the joint meeting of the Institute of Mathematical Statistics, Biometric 
Society (E.N.A.R.), and American Statistical Association, Pittsburgh, March, 1959. The author gratefully ac- 
knowledges the suggestion of H. O. Hartley that led to the investigation reported here, and the University of Ken- 
tucky Computing Center for use of their facilities in calculating efficiencies given in section 3. 
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the strata. Capitals will stand for population values and lower case letters will 
stand for the corresponding sample quantities. With a random sample of size n, 
from each stratum, a simple estimate of the grand mean of the Y population is 
Ni N A 
> Yhiy 


i = i= DL 


N A Nn, ‘ 


where 


N= >, 


Ignoring the finite multiplier the variance of 9,, is 


and the usual estimate of the variance is 


Nn 
N? 


v(Fa) = i 


8h = DX (ai - jr) 
n- 1 ‘ 

We wish to investigate the effect on v(9,.) of varying the sample size within 
each stratum subject to the constraint }\n,=n for fixed n. This can be carried 
through in a straightfo: «rd manner by writing the variance of v(¥,:) as a 
function of the mn, and minimizing the expression subject to }>m=n. 

Consider the estimator sj. The variance of s is (ignoring the finiteness of 
the population) 


n- 3 4 
Sh, 


n(n, — 1) 


where M, is the fourth population moment about the mean in the Ath stratum 
[2]. Substituting 62, = M/S} we find for large n, the approximation 


4 


2 Sr 
V(s,) = — (Bx — 1), 
Nh 


and for the estimator 9,; 


V[v(5.0)] = > ue 
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By differentiating the expression V [v(j..) ]++-A(n— Dona) with respect to » 
and the ma, setting the partial derivatives equal to zero, and solving for the ma, 
we find that the optimum values for sample sizes for estimating V (4,:), desig- 
nated as ny, are proportional to the product of stratum size, standard deviation, 
and the term (82,—1)™*, Le., 


ny « NSi(Ban — 1)'"4, 


. NN wSa(Bor — 1)*/4 

"SNS — 
nBy 
eh 





where 
B,, = NS. (Ben — 1)!/4 and B= > Bu. 


Now the optimum allocation for estimating the mean by §,: [3, 7] is given by 


Hence optimum allocation for variances is the same as that for means if the 8's 
are the same in all strata. 

Let us compare the variance of v(,:) under the nj allocation against other 
allocations in order to obtain an expression for the efficiency of v(f,,). Let 
j*=9,, under the nf allocation, let V = V [v(J.:) ], and V* = V [v(5*) |. Then 


l 1 
N‘(V pee V*) > Bi( = oe —s) 
mh mM, 


oe 
> Die teter ts (2) 
Ny 


n® 


By “completing the quartic” and identifying ny in terms of (1) we find that 





Be is — may 4(n, — n4)? 2 6(n, — 4)? ) 
nN), nN, Th 


The first factor on the right hand side is V*. Then the rise in V relative to V* 
in terms of optimum and arbitrary sample sizes is given by 


*2 2 * 
V —vy* - . Na — Nn\? 
ee at ales 3{ ——— 3 (3 
V n ni, Nh 
: = whaenw, ‘y 


If the sample is allocated in proportion to the size of the strata (n,=nN;}/N), 
substitution in expression (2) leads to 


N* 1 
NV — V*) = = be NwSi(Ben —-l1)- = [>> NwSn(Ban — 1)*/¢]). 
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There is no loss in efficiency of the variance estimate here if the quantities 
Sa(Ba—1)“* are constant over strata. 

A set of results paralleling those above is obtained when a simple cost func- 
tion is brought into the situation as an additional restriction. Let the total fixed 
cost C= }°eany. Then we have 

CByer'!4 


n* = 


*2 2 


V —_ de l in = Wa 2 
Caen phen (E GAY +4 


4 3/4. 4 
B, (> Bicr ) 
N4“V — V*) = >> — - — 
3 3 
Ni, C 
Some insight into the range of penalty to be expected for using a sample 
allocation that is not optimum for variance may be gained by computing a few 
examples of efficiency of v(j,.). Suppose that an allocation {n,=#% (constant) } 
deviates from {nf} by k sampling units in each stratum, i.e., 


|a—m| =k, -e ZL. 


Although entirely artificial and unlikely in a strict sense this example does not 
depend on the number of strata and permits use of formula (3). Table 1 indicates 
that in this instance serious losses in efficiency of v(¥,,) would occur only for 
rather wide departures from {ny}. This is the usual sort of result in optimum 
allocations, i.e., one often finds a wide range of near-optimum allocations. 


3. TWO-STAGE SAMPLING 
Here we shall assume a model in which n primary sampling units are drawn 
at random from an infinite universe of primary units, each containing an in- 


TABLE 1. EFFICIENCY OF V RELATIVE TO V* FOR 
SELECTED VALUES OF |nj—a| AND 


(Entries in the table are V*/V expressed as per cents) 





3 
> 
| 
= 





ono ur WON 


_ 
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finite number of subunits or elements. Within each selected primary unit ele- 
ments are chosen at random. An observed value y;; on the jth element in the 
ith primary unit is assumed to be of the form 


ys = VY tut wy 


where Y is the grand mean of the y’s and u,; and w,; are independent with zero 
means and variances S2 and S2, respectively. Under this model the variance of 
the sample mean 7=1/nm }-y,; is (see e.g. [1, section 10.2]) 

ae se 2 

VQ =—+—: 
n nm 

A simple unbiased estimate of V(y) can be obtained from the “between units” 
mean square in the analysis of variance given below. 





ANALYSIS OF VARIANCE OF THE SAMPLE 





d.f. Mean square Expected mean square 
i 





Between units n—1 m 8, Sit+m S. 





Within units n(m—1) 8. Ss. 





83 
Then v7) = —- 
n 


Suppose a cost function, C=c;n+cnm, is brought to bear on the sampling 
scheme. Our problem is to find the effeet on v(y) when n and mare varied subject 
to the cost function. In the present situation the variance of the estimated 
variance is a more complicated expression which does not yield a simple solution 
for an optimum choice of m and n. However, the variance expression is reason- 
ably amenable to calculation for specified values of population parameters and 
sample size so that the optimum choice of sample size may be determined 
numerically. 

The variance of v(j) may be written [4] 


ae 1 1 
Vio@)] = a8 M,+ ree Qa+ 


* ee eo 


where 
M,= E(u; -_ Eu,)* and Q, = E(w; -_ Ew3;)*. 


(The variance of the mean square is also obtainable from Tukey’s expressions 
for variances of variance components [8].) By writing 
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M, 
~ ee 
Sy 


Be 


? 


and dropping terms in 1/n the variance reduces to 
(m— 1) 4 


? es = 2 4 22 
V[og)] = — Su(82 — 1) + Sa Selrs — 1) a Se + —— SWSe. 


Eliminating n by the substitution 


we obtain after some manipulation 


4 


cSu (er, _\? S.\' Se 
vo] = =" (2 +m) {2(1+ =) + -3)+@-9f. 


Ignoring the term c3S%/C* in (4), we can compute values of V relatively by 
supplying numerical quantities for c:/c2, S./Su, Bz: and y2, and m. Relative 
variances have been computed for m=1, 2, - - - , 20 for all combinations of the 
parameters given below. 

Ci 


— = 10,5,1 
C2 


= 16,9,4,1 


B: = 4,3,2,1 
¥: = 4,3, 2,1. 


For each combination of these four parameters that value of m was determined 
which yielded the smallest relative V. Call this optimal subsample size m*. 
For estimating the mean an allocation given by 
1 
m’ = — 4/— 


« C2 


is known to be optimum under the assumptions made here [5, 6]. The efficiency 
of v(j) under the m’ allocation versus the m* allocation has been tabulated for 
the combinations of parameters listed above. The results are given in Table 2. 
Within the range of the computations a few general tendencies stand out. 


a) Efficiency of »(J) increases as the ratio S%,/S? decreases, for 6222. 

b) Efficiency of v(f) increases as y: increases and as 62 decreases, except 
for S2,/S2=1. 

c) For high cost ratio (c:/c:= 10) the efficiency of v(f) is fairly insensitive to 
the variances and coefficients of kurtosis. 





TABLE 2. EFFICIENCY OF o(j7) WITH SAMPLES OF m’ RELA- 
TIVE TO SAMPLES OF m* FOR SELECTED VALUES OF 
¢:/¢2, S2/S2, B2 AND v2 


(Values of m* are given in parenthesis) 








¢:/cz =10, S2/S2=16, m' =13 


€:/c2 =10, S2/S2 =9, m’ =9 





1 2 3 


4 


1 2 3 


4 





91(1) 94(9) 95 (10) 
82(1) 90(8) 91 (8) 
74(1) 85(8) 87 (8) 
68 (1) 82(7) 83 (8) 





96 (10) 
92 (9) 
88 (8) 
84 (8) 





99(8) 100 (8) 
95 (6) 96 (7) 
90 (6) 92 (6) 
86 (5) 88 (6) 


100 (9) 
97 (7) 
93 (6) 
90 (6) 


100 (9) 
98 (7) 
95 (7) 
91 (6) 








¢:/¢2=10, S2/Si =4, m' =6 


¢1/¢2 = 10, S2/S2=1, m’ =3 





1 2 3 


4 


1 2 3 





99(7) 98(7) 98(7) 
97 (5) 98(5) 99 (5) 
93 (4) 94(4) 96(5) 
89 (4) 90(4) 92 (4) 


98 (7) 
99 (5) 
96 (5) 
93 (4) 


88 (5) 
100 (3) 
100 (3) 

95 (2) 


90 (5) 
100 (3) 
96 (2) 
92 (2) 


89 (5) 
100 (3) 
98 (2) 
94 (2) 








¢:/c2 = 5, S2/S2 =16, m’ =9 


¢:/ce2 =5, S2/Si=9, m' =7 





Bs 


72 
1 2 3 


4 


1 2 3 





39(1) 87(5) 90 (6) 
87 (1) 83(5) 86 (5) 
35 (1) 80(5) 83 (5) 
33 (1) 77(5) 80 (5) 





92 (6) 
89 (6) 
86 (5) 
83 (5) 


93 (5) 
89 (4) 
85 (4) 
81 (4) 


56 (1) 
51 (1) 
47 (1) 
44 (1) 


91 (4) 


78 (4) 








€:/¢c2=5, S2/S3 =4, m' =4 


¢1/c2 = 5, S2/S2 =1, m’ =2 





72 
1 2 3 


4 


1 2 3 





1) 100 (4) 100(4) 1 
1) 96(3) 99(3) 1 
1) 93(3) 95 (3) 
1) 90(3) 93 (3) 


6 ( 
7 ( 
1 ( 
6 ( 


8 
7 
7 
6 


00 (4) 
00 (4) 
97 (3) 
95 (3) 


94 (3) 
100 (2) 
100 (2) 

94 (1) 


92 (3) 
100 (2) 
100 (2) 
100 (2) 


98 (3) 
97 (1) 
89 (1) 
84 (1) 








¢:/¢: = 1, S2/Si=16, m’ =4 


¢:/¢2 =1, S2/S2 =9, m' =3 





v2 
1 2 


1 2 3 





10 (1) 
10 (1) 
10 (1) 
10 (1) 


47 (1) 


80 (2) 


77 (2) 


19 (1) 82 (2) 
19 (1) 
18 (1) 
18 (1) 


54 (1) 
53 (1) 
51 (1) 


50(1) 76(1) 








¢:/e2 =1, S2/S2=1, m’ =1 





-Y2 


1 2 3 

















100 (1) 
100 (1) 
100 (1 
100 (1 


100 (1) 100 (1) 


100 (1) 100 (1) 








100 (1) 
100 (1) 
100 (1) 
100 (1) 


———$$—— 
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ON STABILIZING THE BINOMIAL AND 
NEGATIVE BINOMIAL VARIANCES 


Nico F. LausscuER 
South African Council for Scientific and Industrial Research 


Transformations for the stabilization of variance of the binomial 
and negative binomial distributions are considered. In each case a new 
transformation is suggested. These transformations are two-term in- 
verse sine and inverse-hyperbolic sine transformations involving six ad- 
justable constants, thus generalizing previous work of Anscombe [1] 
and of Freeman and Tukey [3]. In the sequel some properties of these 
transformations are derived. The nature of the dependence of the vari- 
ances of these transformations on some parameter is worked out and 
depicted graphically. 


1, TRANSFORMING THE BINOMIAL 


NscoMBE [1] showed that if z is a binomial variate with mean m=np and 
pe np(1—p), the transformation 


x + “\" 
n+ C3 


stabilizes the variance of z. This transformation has variance with asymptotic 
expansion 


ii(z) = (n+ ci)*/sin-+( 





(1) 


Var (t,) ~(14+- 
ar Fed am 
7 cn 


al a) 
2n 8m 8(n — m) 


under the condition that m/n remains constant for large values of n. Thus the 
Anscombe transformation 


i, = (n + 4)1/2 sin-! eae 
n+ 3/4 


has an asymptotic variance }+0(n~*) when n—«. Obviously, this is a gen- 
eralization of the well-known Bartlett transformation 


x 1/2 
te = n'/2 sin-! (=) (2) 


n 


which was studied in detail by Eisenhart [2]. 
A further generalization of (2) was suggested by Freeman and Tukey [3] viz. 


ts = (n+ ay sin (—)" ‘wakes (: a yh. 
=" n+1i/) J} 


In what follows we shall prove that the variance of this transformation has an 


asymptotic expansion 
1 
—- — —- ——_ + O(n) _ when n—>~, 
8m = 8(n — m) ig 
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Pursuing their idea we now consider the transformation {(s) =u+v, where 


ue 8+ m,\'/? 
u=m, sin 


m3 


nj? , 8 + mz \'/? 
v=m, sin 


ms 
and where 


s=Z2—mM, m=n+c,, mi =n+cl, Mme=m+c2o, me =mM+c{, Ms=Nn+C;3 
and m3=n+c3. Evidently, 


- 1/2 1 s 
) = =f (mz + 8)~!/?(m3 — me — 8)~*/"ds 
7 


where k is some arbitrary constant. Expanding the integrand in a power series 
in s and integrating, we find that 


: 8 ob m2 1/2 « 
sin = > asi 
t) 


ms 


ms \1!2 
@ = sin~!{ — 
m3 


a, = $mz'!/2(m, — m,)—!/? 


where 


a2 = }(2m, — m3)mz*/2(mz — mz)—*/? ete. 


sta\'* =. 
an~* | — = > afs*, 
0 


mi 
then it may be derived that 
«e oo 
Cov (u,v) = (mym{)'/? > > a G0; (Hi4; — mints) 
im jm 


where yu, = E(s*), (¢=0, 1, 2, - - - ). Since we; and we}; are of O(n*) when n> ~~, 
(¢=1, 2, 3, +--+), and also since a; and a/ are of O(n~*) when n-~, (i=1, 
2,3, ---), it is evident that 


Cov (u,v) = (mym{)"/?[ayaf ws + (azaf + aad )ps 


+ (asaf + ayaz) us + arad (us — u)] + O(n-*) when n— ~, 


By using the moments of the binomial distribution and the values of a; and a/, 
we find 


Cov (u, v) 
l E gnats - bt = Ont) | ot Sew) Ee) : 


wae On 8m 8(n — m) 
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By using this expression together with . xpressions derived from (1) in the well- 
known formula for the variance of the sum of two dependent variables, we find 
ate —-1l 3—4e+c¢c7) 3+4@+ 7%) —4(c3 +03) 


V ~1 . 
ete) * 2n * 8m * 8(n — m) 





From this we find that if 


xz 1/2 
ty = n'/2 sin! (=) + (n + 1)!/? sin ( 


n 


z+ a 
n+3/2) ’ 
then Var (t,) = 1+0O(n~), when n—. Obviously, the result on the asymptotic 


expansion of the variance of the Freeman-Tukey transformation—which was 
mentioned earlier—follows by proper choice of c’s. 


2. NT'MERICAL INVESTIGATION 


Tables 1 to 3 show, for n= 10, 30 and 50, respectively, the values to four deci- 
mal places of Var (¢;) and Var (¢,) corresponding to different values of p. Bi- 
nomial probabilities were generated by means of the iteration formula: 


Po = (1 — p)’, 
n=-ztile»p 


n 
Pe= ("pra = pyr = Pes a ie z= 1,2,3,---,n. 





From Table 3 it is seen that & is a very efficient transformation for the 
stabilization of the binomial variance for large values of n. 

Fig. 1 shows the nature of dependence of the variances of f, ta, t; and ¢, on p 
when n= 30. 

Var (t:) was tabulated by Anscombe [1] for n=10 and 0<p<0.4. Var (ts) 
was tabulated by Eisenhart [2] for n=10, 20, and 30 and for 0<p<1. We 
have also calculated Var (¢;) and Var (t:) for n=10, 20, 30, 50. Since the 
functions Var (¢) are all symmetric about p=} we consider the case 0< p<} 
only. If any assertion concerning Var (¢) holds in the interval 0<pi<p<p2<}, 
then it also holds in the interval }<1—p2<p<1—p:<1. From Fig. 1 it is 
clear that the Anscombe transformation is superior to the others. There exist, 
however, certain domains of p where better transformations may be obtained. 
For instance when n= 10 and p=0.09 the transformation to be employed is &. 


3. TRANSFORMING THE NEGATIVE BINOMIAL 


Let x be a negative binomial stochastic variable with mean value m and 
exponent k, (m, k>0). It is well-known that the variance of the distribution of 
z is equal to m(1+m/k). 

A transformation for stabilizing the variance as suggested by Anscombe 
[1] is 


T, = (k- 4) sinh ( 


t+ 3/8\'2 
k- asl 


Anscombe allocates an asymptotic variance 4+O(m-*) to it under the condi- 
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TABLE 1. RELATIONSHIP BETWEEN p AND VAR (é:), VAR (ts) 


Var (3) Var (ts) 








0.4168 
.6873 
.8570 
. 9584 
.0413 
.0478 
.0296 
.0084 
-9917 
.9807 
.9747 
.9729 


.4533 
.7457 
.9276 
.0347 
-1181 
.1188 
-0937 
.0663 
-0449 
-0307 
.0229 


— eet et et et OO CO 











TABLE : 
n =30 








Var (ts) Var (t.) 


0.8433 
1.0379 
1.0526 
1.0304 
.9955 
9846 
.9829 
.9834 
.9841 
.9846 
.9850 
9851 


.8985 
.0986 
. 1063 
.0756 
.0283 
.0105 
.00438 
.0029 
-0020 
-0016 
0074 
.0014 








pt eee ttt et et 








TABLE : 


n=50 





Var (3) 





1.0041 
1.0491 
.0135 
.9940 
9858 
9867 
-9881 
.9891 
.9898 
.9903 
-9905 
-9906 


_— 
— 
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STABILIZING THE BINOMIAL VARIANCE 
(n= 30) 


Fia. 1 

















tion that m/k is constant when m— ~~. This transformation can evidently be 
obtained by considering the general transformation 


z + C2 1/2 
t= (k 1/2 sinh? 
errr G + <3, 
for which we find an asymptotic variance 
4 Var (t) = 1 + (2c; + 1)/2k + (3 — 8c2)/8m 


— [3 + 8(c2 + ¢s)]/8(m + k) + O(m-*), 
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when m— and m/k=constant. 
As before, we consider the transformation 


t=u+ov 
where 


z+ c2\1/2 
= (k 1/2 sinh! 
u = (k + ¢,)'/* sin (=*) 


, 1/2 
v = (k+ cz)? sinh-( ) 


and in a similar way as in reference [1] we find the asymptotic variance of this 
transformation: 


1+ (c, + ef + 1)/2k + [3 — 4(c2 + cf) ]/8m 
— [3 + 4(c2 + 3 + cf + cf)]/8(m + k) + O(m-). 
By choosing ci =c{ =cj =0 and c,= —1, c2=3/4, cs= —3/2 we obtain the 
transformation 
T; = k*/? sinh! (2/k)'/? + (k — 1)*/? sinh“ (—)" 
k — 3/2 
defined for k >3/2. This transformation has an asymptotic variance 1+O(m7*). 


4. NUMERICAL INVESTIGATION 


Tables 4 and 5 show, for k=5 and k=10, the values to four decimal places 
of Var (7) and Var (T:) as functions of m. We have generated the negative 
binomial probability function 


r k z 
P,= : - ( = ) (1 + m/k)-*, (x = 0, l, ee 
zIr(k) \m+k 





by means of the iteration formula 

Po = (1 + m/k)* 

P, = P,ym(xz +k — 1)/z(m+ k), (ef = 1,2,3,---). 
Let us consider the transformation 7’; to illustrate the issue. 


, 3/8\1/2 Ed « 2 
Var sinh (—) | = >a.- ( rs.) , where 


k — 3/4 0 


x + 3/8\'? z+ 3/8\'” 
S, = P, sinh“ ( ) and Q, = S,sinh (= *) ‘ 
¥ M 2 k = 3/4 


Now, by considering )-S,, it turns out that the error involved after summing 
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TABLE 4. RELATIONSHIP BETWEEN m AND VAR(T)), VAR(T:) AS A 
PROPORTION OF THE LIMITING VARIANCE 


k 




























Var(T;) Var(T:) 

1 0.6367 0.9027 
2 0.8448 1.0563 
3 0.9253 1.0703 
4 0.9605 1.0582 
5 0.9773 1.0439 
6 0.9858 1.0321 
7 0.9904 1.0232 
8 0.9930 1.0165 
9 0.9944 1.0116 
10 0.9953 1.0079 
20 0.9964 0.9959 

















TABLE 































k=10 
m Var(7':) Var(T:) 
1 0.6768 0.9478 
2 0.8863 1.0864 
3 0.9579 1.0813 
4 0.9842 1.0581 
5 0.9943 1.0390 
6 0.9983 1.0259 
7 0.9998 1.0173 
8 1.0003 1.0117 
9 1.0004 1.0081 
10 1.0003 1.0056 
20 0.9995 0.9996 











the first N terms, (N large), is approximately equal* to Sy/Ln(1+k/m). 
Similarly, the error involved in )°Q, after summing to N terms is 
Qyv/In(1+k/m). Hence, the error in the variance is approximately equal to 


=(" o? +Q ) 
k oe N , 


for large values of m. In the case k=5, m=20 we find that the error in the 
variance, after summing to 80 terms, is equal to 2.10-*. To make sure of our 
results these series were calculated up to z= 200. 

Fig. 2 shows the relationship between the variances of these transformations 
when k= 10. It is evident that 7: is a good transformation down to m=1. 

The calculations were done on the “ZEBRA” digital computer of the Na- 
tional Physical Research Laboratory. 


* This result was pointed out to us by Alewyn P. Burger. 
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ON MATCHING LISTS BY SAMPLES 


Des Ras* 
National Statistical Service of Greece 

















The problem considered is estimation by sampling methods of the 
number of names common to two or more lists. The situation covered 
is that in which the lists are found to have been merged and must be 
sampled as one. A comparison is made with the case in which the lists 
can be sampled separately. Tables are included giving the sample sizes 
necessary to achieve a specified level of precision. 


1, INTRODUCTION 

















HE problem of estimating by sampling methods the number of names 
gyre t ose to two or more long lists of names has been considered previously 
by Goodman [1] and Deming and Glasser [2]. A simple random sample of a 
predetermined size is taken from each list and the estimate is based on the 
observed number of names common to the samples. There may be situations, 
however, where the lists are maintained at a central place in one register and 
it is very costly to separate them out before selecting the sample. In such a 
case a sample is selected from the entire register and the sample names are 
allocated to the relevant lists. This note presents some theory covering this 
situation. A comparison is made with the other case in which samples are se- 
lected separately from each list and a few results are given regarding the latter 
situation. 


2, LISTS SAMPLED SEPARATELY 












Let there be two lists containing N and M names and D be the number of 
names common to them. Samples of sizes n and m are selected at random with- 
out replacement from the two lists and d names are observed to be common to 
the two samples. As an unbiased estimate of D we have from Deming and 
Glasser [2, p. 411] 

D= = d. (1) 


nm 






The variance of this estimate is given by 


1 pf + (n — 1)(m — 1) 


V(D .. 
asad (N — 1)(M — 1) 


nm 








(D - »| _ D? (2) 


and an unbiased estimate of this variance is provided by 












$b) = of = — 1)(M — 1) 7 | 
(n — 1)(m — 1) 
Pe ee ee. m (N—1)(M —1) 
+ B{1 n—-lm—1 NM | @) 










* United Nations Sampling Consultant in Greece. 
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The total cost of the investigation may be given by 
C = co (n +m) + cynm (4) 


where ¢ is the cost of selecting a name from a list and preparing it for compari- 
son and ¢; represents the cost of comparing a name in one sample with a name 
in the other sample. Usually c; will be quite small as compared to co. In order 
to determine the best sample sizes n and m, which minimize V(D) for a given 
cost C, we substitute 


n = (C — cym)/(co + cm) (5) 


in the expression for the variance of D. We have 


“ V(D) = NMD(co + keo — kC)(cxm? + 2cum — C)/[m2(C — com)*] (6) 
™m 


where 
k = (D — 1)/[(N — 1)(M — 1)]. (7) 
This shows that the variance has a stationary value when 
cym? + 2ccem — C=0, or n=m. (8) 


The second derivative evaluated at this point is 


2 


7] 4 
— V(D) = 2NMD(co + keo — kC) (co + cym)/[m*(C — com)?] 


om 
which shows that the variance is minimum for n= if 
Co + keg — kC > 0. (10) 


Thus it is best to sample an equal number of names from each list, provided D 
is not very large. The exact condition is 


(N — 1)(M = Ieo 


D<1+- il 
— (11) 





This condition is not satisfied when, for example, 
N = M = 10,000, ¢, = 0.1, co = 5, C = 110,000, D = 6,000. 


3. LISTS SAMPLED AS ONE 


Let us assume that the two lists are merged into one with R names. A simple 
random sample of r names is selected without replacement and the names in 
the sample classified as those belonging to the first or to the second list. Let d 
be the observed number of names common to the two lists. A random variable 
5,(j=1, 2,---, R—D) associated with the jth distinct name is defined such 
that 


5;=1 if the name occurs in both sample lists, 
=0 otherwise. 
It is easily noted then that 





ON MATCHING LISTS BY SAMPLES 


R 2 

P= 1)=(5)/(5) = 8 - Be, 
| r R a 
ze) =(7)/() for] # k, 


d= > 4, 


rn = 0(°)/(2) 
ni ~ (5) /(2) +000 0(5)/(8) 


Hence an unbiased estimate of D is provided by 
R(R-1 
( ) 4 
r(r — 1) 


is 


with a variance sven by 
V(b) = oft ois oe aoe |- D*. 
r(r — 1) (R — 2)(R — 3) 
As an unbiased estimate of the variance of D we take 
r(r—1) (R — 2)(R — 3) 
~ R(R— 1) eee 


+efaBe=8.. 0} 
(r — 2)(r — 3) 








0(D) = p+/1 





The extension to & lists is immediate. 


4. COMPARISON OF THE TWO SITUATIONS 


It is of interest to compare the variances in the two situations, assuming that 
it is equally feasible to sample the two lists separately or as one. The comparison 
will be restricted to the case where the two lists are of the same size W. A sample 
of size u is selected from either list in the first case (assuming that this is the 
best thing to do) and the sample size is 2u in the second case. We have 


V(b) = Din gt in. »| ~ D (20) 
AOR. cos (W — 1)? 





i 5 W(2W — 1) pit (u — 1)(2u — 3) 


V2(D) = (D — »| - D*, (21) 


u(2u — 1) (W — 1)(2W — 3) 
The difference of the two variances is given by 
Ww(w —- — 1)(2 2W -3 
V;-V - fi - )Qu + ) 
u*(2u — 1) (W — 1)*(2W — 3) 





(D - |. (22) 


Using inequality (11), we have 
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D-1<— 


Hence, 


V2—Vi> 


W(W — u) 


(W — 1)%eo 


= (W — 1)? 


(W— 
< = 


1)? 





C1 
(2Qu—-1)+— wu? 
Co 


2u — 


(2W — 2u — 1) 





u?(2u — 1) 


, >0 
(Qu — 1)(2W — 3) 


1 


(24) 


This proves that from the standpoint of variance it is better to sample the 
two lists separately. The relative increase in variance is, however, negligibly 
small. Tables 1 and 2 given in the next section substantiate this point. 


5. SAMPLE SIZES FOR SPECIFIED PRECISION 


We will now present Tables 1 and 2 giving the total sample sizes required in 
order to achieve coefficients of variation of 10 per cent and 5 per cent for 
estimating the number of names common to two lists. The following situations 


are covered. 


(i) 
(ii) 


size being the same as in (i). 


(iii) 


shorter than the second list in (ii). 


The two lists are of equal size with N = M=15,000. 
The two lists are of unequal size with N = 10,000, M = 20,000, the total 


One list has the same size as in (ii), namely 10,000, but the other list is 


It is of interest to note how the sample sizes vary in different situations. 


TABLE 1. SAMPLE SIZE REQUIRED FOR A COEFFICIENT 


OF VARIATION OF 10 PER CENT 





Number of com- 
mon names 
D 


Lists sampled as 
one, total size: 
30,000 


Lists of equal 
size sampled 
separately 
N =15,000 
M =15,000 


Lists of unequal 
size sampled 
separately 
N =10,000 
M =20,000 


Lists of 
equal size sam- 
pled separately 

N =10,000 
M =10,000 





(1) 


(2) 


(3) 


(4) 


(5) 








1,000 
1,500 
2,000 
2,500 
3,000 
4,500 
6,000 
7,500 
9 ,000 
10 ,000 
10,500 

12,000 
13 ,500 

15,000 








4,459 
3,680 
3,199 
2,865 
2,615 
2,129 
1,836 
1,635 
1,486 
1,399 
1,370 
1,276 
1,198 
1,132 





2,952 
2,430 
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TABLE 2. SAMPLE SIZE REQUIRED FOR A COEFFICIENT OF 
VARIATION OF 5 PER CENT 








Number of com- 
mon names 
D 


Lists sampled as 
one, total size: 
30,000 


Lists of equal 
size sampled 
separately 
N =15,000 
M =15,000 


Lists of unequal 
size sampled 
separately 
N =10,000 
M =20,000 


Lists of 
equal size sam- 
pled separately 

N =10,000 
M =10,000 





(2) 


(4) 


(5) 





sss 


nowon 
S88 


NO kr Wd Ne 





7,884 
6,710 
5,926 
5,354 
4,913 
4,025 
3,473 
3,087 
2,798 
2,642 
2,572 
2,388 
2,235 
2,106 





~I 
= 
S 


NOwWwnwworrooug s| 


wSanonwor OS O&O 


eS) 
= 





w 
Oo Ot 


ao bh 


Or ora em sI 
S2yenoonw 
€ > Oto & ® bo 


bo oo 
rs 


nt oO 
oO © 


bo bo 
~~ 
~] * 
eo 








REFERENCES 


{1] Goodman, Leo. A., “On the analysis of samples from k lists,” Annals of Mathematical 
Statistics, 23 (1952), 632-4. 
[2] Deming, W. E. and Glasser, G. J., “On the problem of matching lists by samples,” 
Journal of the American Statistical Association, 54 (1959), 403-15. 








PROBABILITY TABLE FOR NUMBER OF RUNS OF SIGNS 
OF FIRST DIFFERENCES IN ORDERED SERIES* 


Evaene 8. EpaGineton 
Kansas State Teachers College 


This probability table is based on the number of runs of signs of first 
differences in ordered series, or expressed differently, the number of 
rises and falls in ordered series, that would occur under this nonpara- 
metric null hypothesis: the relative magnitude of an observation is in- 
dependent of its position in the series. The runs test used in conjunction 
with the probability table in this article is probably less sensitive than 
rank order correlation techniques for detecting monotonic correlations, 
but much better for rejecting the above null hypothesis in dealing with 
series involving cycles or some other non-monotonic systematic rise and 
fall in the magnitude of the observations, such as U-shaped or inverted 
U-shaped series of observations. 


1. DEFINITIONS 


trsT differences in ordered series can be obtained by subtracting the first 
| pore in the series from the second, the second from the third, and so on, 
each value being subtracted from the next in the series, with each subtraction 
yielding a first difference. This provides a series of first differences. Discarding 
the numerical value of the first difference in every case and retaining only the 
direction (algebraic sign) of the difference results in a series of signs of first dif- 


ferences. 

A run of signs of first differences refers to a group of one or more identical 
plus or minus signs in the series of signs of first differences. Any two adjacent 
runs must be runs of different signs. 

An example will help clarify the above definitions. Consider the following hy- 
pothetical data, showing reaction time as a function of age. Values of the inde- 
pendent variable are arranged in their natural order, with the corresponding 
values of reaction time beneath them. 





Age (years): 14 15 16 17 18 19 20 
Reaction time (1/100 sec.): 60 57 54 53 56 57 57 
First differences: : : : +1 

Signs of first differences: 





The O first difference between 57 and 57 has no sign, so it is not included in the 
series of signs of first differences. This elimination amounts to excluding one of 
the tied observations from consideration for the runs test. Such an adjustment 





* Andre's recursion formula used to determine the distribution of runs for constructing this probability table, 
and the derivation of the normal approximation formulas for the mean and standard deviation of the distribution 
of runs are given in this reference: Wallis, W. Allen and Moore, Geoffrey H., “A Significance Test for Time Series 
and Other Ordered Observations,” Technical Paper 1, September, 1941, National Bureau of Economic Research, 
New York, New York. On page 35 they presented a probability table designed to serve a purpose similar to the 
purpose of this one. Table 1 has these advantages over the table Wallis and Moore published: (1) it can be used 
for either one-tailed or two-tailed tests, instead of being restricted to two-tailed tests; (2) it can be used for as many 
as 25 observations, instead of 12 observations; (3) it is expressed in terms of runs, instead of turning points (changes 
in signs of first differences), making it more consistent with other tests. 
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needs to be made only when there is a O difference; i.e. when the tied observa- 
tions are adjacent. There are four runs of signs of first differences: a run of three 
minuses, followed by a run of four pluses, then a run of one minus, and finally 
a run of two pluses. 


2. TYPE OF STATISTICAL PROBLEM FOR WHICH TABLE 1 IS APPROPRIATE 


Table 1 is based on the number of runs of signs of first differences in ordered 
series that would occur under this nonparametric null hypothesis: the relative 
magnitude of an observation is independent of its position in the series. In other 
words, Table 1 is based on the number of runs to be expected under “random 
fluctuation” of the observations. A quantitative estimate of the power of tests 
utilizing the number of runs of signs of first differences for rejecting the above 
null hypothesis is not available. The runs test employed below under the head- 
ing “How to use Table 1” probably would be less sensitive than rank order cor- 
relation techniques in dealing with situations where the magnitude of an ob- 
servation is a monotonic function of its rank position in the series, but much 
better for rejecting the null hypothesis stated above in dealing with a series in- 
volving cycles or some other non-monotonic systematic rise and fall in the mag- 
nitude of the observations, such as a U-shaped or inverted U-shaped series of 
observations. ; 

The usefulness of Table 1 in dealing with cyclic phenomena can be illustrated 
by two examples. Assume that a study of a homeostatic process in animals in- 
volves the expectation of a gradual oscillation of the magnitude of some speci- 
fied effect with the passing of time. The investigator should expect fewer runs 
than ordinarily would occur if the null hypothesis were true. In the second ex- 
ample, assume that a study deals with a process for which it is expected that 
each successive manipulation generally produces an effect opposite to that of 
the previous manipulation, analogous to flipping a light switch. In this case one 
should expect a large number of runs. In the first example, the investigator 
should use Table 1 to determine the probability, under the null hypothesis, of 
so few runs as he obtained; in the second example, the investigator should use 
the table to determine the probability of so many runs as he obtained. 


3. HOW TO USE TABLE 1 


The proportion given in Table 1 at the intersection of o observations and r 
runs is the proportion of the permutations of o different numbers that yield r or 
fewer runs of signs of first differences. 

In the hypothetical data presented above there are four runs. Since one tied 
observation was discarded the number of observations used in computing the 
number of runs is eleven. The probability of four or fewer runs is determined 
by noting the value at the intersection of eleven observations and four runs in 
Table 1, which is .0239. 

The table is set up in a way that allows direct determination of the probabil- 
ity of r or fewer runs. The probability of r or more runs is 1.0000 minus the 
probability of r—1 or fewer runs. Enter the table with o, the number of observa- 
tions used, and r—1 runs (one less run than obtained) and subtract the propor- 
tion at the intersection from 1.0000. 
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The probability of so extreme a deviation from the mean number of runs as 
r runs (two-tailed probability) is the probability of M— | d | runs and fewer plus 
the probability of M+ | d| runs and more, where | d| is the absolute deviation 
of the obtained number of runs from the mean, and M is the mean number of 
runs. The formula for the mean number of runs is (2N —1)/3, where N is the 
number of observations used in determining the number of runs. The values for 
M+|d| and M—|d| may not be integral in some cases, although probabilities 
for numbers of runs have meaning only for integral numbers of runs. The num- 
ber of runs is a discrete variable; there are only integral values, no fractional 
runs. Consequently, one should not interpolate probabilities in the table. For 
example, assume that we have eighteen observations and fourteen runs, and 
our hypothesis suggests a two-tailed test. The mean number of runs for eighteen 
observations is (36—1)/3=11%. The obtained absolute deviation from the 
mean, |d|, is 24. M—|d| =9}. M+ |d| =14. The probability of 9} or fewer 
runs is the same as the probability of 9 or fewer runs; namely, .1006. The prob- 
ability of 14 or more runs is 1.0000 —.8611 (the probability of 13 or fewer runs) 
which is .1389. The probability of such a large deviation from the mean as was 
obtained is .1006+.1389 = .2395. 





4. NORMAL CURVE APPROXIMATION FOR N>25 


The formula for the mean number of runs is (2N —1)/3 and the runs tend to 
be normally distributed about this mean with a standard deviation of 


[= 
9 | 


where N is the number of observations. The significance of the number of runs 
obtained can be determined from normal curve tables in the usual manner. To 
correct for the discontinuity, or discreteness, of number of runs, it is necessary 
to reduce the obtained deviation from the mean by 0.5. 
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Testing Statistical Hypotheses. E. L. Lehmann. New York: John Wiley and Sons, Inc., 
1959. Pp. xiii, 369. $11.00. 


Joun W. Pratt, Harvard University 


LMOST anyone interested in mathematical statistics will find many things he 
A ought to know but doesn’t about even the most familiar statistical problems in 
the course of this impeccable, detailed, mathematically rigorous treatment of “opti- 
mum” properties of significance tests and related confidence procedures. It amply 
fulfills the promise of Lehmann’s famous 1949 lecture notes Theory of Testing Hy- 
potheses, covering more material more completely, but in the same style. In fact, never 
before has the theory of significance tests initiated by Neyman and Pearson been 
so thoroughly explored and so definitely treated. A review of the book therefore seems 
to me to call for a review of the status of the theory. Some serious reservations on this 
score follow a summary and description of the book and a few criticisms of details. 

Chapter 1 introduces various concepts and attempts to broaden the justification of 
the rest of the book. 

Chapter 2 is a lovely bonus. The theorems of measure theory used in probability 
are stated without proofs. Conditional probability and sufficiency are then discussed, 
succinctly but with complete proofs, at exactly the right level of generality for most 
theoretical statistics; the given is a statistic, not a o-field, so that intuitive conditional 
arguments can be carried out immediately; but the treatment is otherwise general, 
so that it covers, for example, the conditional distribution of a continuously dis- 
tributed random variable given its absolute value, or of a sample given the order 
statistics. 

Chapter 3 discusses uniformly most powerful tests, related confidence procedures 
(which are what Lehmann calls “uniformly most accurate”), and the sequential 
probability ratio test. Chapters 4 and 5 discuss uniformly most powerful unbiased 
tests and related confidence procedures. Chapters 3-5 cover all one- and two-sided 
problems in exponential families (in particular the usual one- and two-sample 
binomial, Poisson, and normal problems), as well as some nonparametric problems. 
A valuable inclusion is the two-sided null hypothesis (such as |@| >.1). 

Chapter 6 discusses invariance, its relation to unbiasedness, some examples, and 
some technical problems. 

Chapter 7 discusses linear hypotheses under normality, using invariance mainly. 
The “large-sample” (asymptotic) theory of chi-square and likelihood ratio tests is 
outlined to exhibit the linearity and normality of the asymptotic situations. 

Chapter 8 discusses maximizing the minimum power over the alternatives, its 
connection with invariance, and stringency. 

The 88 examples and 228 problems are seldom routine and present much additional 
interesting material. Each chapter has a list of references annotated to indicate their 
significance for the chapter. An appendix discusses four technical matters, and there 
are subject and author indexes. The erratum-rate is commendably low. 

The summary above scarcely hints at the enormous number of results about both 
parametric and nonparametric problems the book brings together, clarifies, simplifies 
and extends. Its scope is nevertheless carefully delimited. The sample size is fixed and 
“small” practically throughout. There is no point estimation theory whatever, even 
when estimates are at hand (linear hypotheses). There is mention that tw»-sided al- 
ternatives often correspond to three-decision problems, but no three-decision proce- 
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dures are discussed. The Neyman-Pearson formulation is used exclusively. Although 
Lehmann illustrates by example a few of the knotty problems that occur in this 
formulation (see below), he doesn’t depart from mathematics long enough to discuss 
them. 

It is possible to dip into this book for statements of theorems. (I have known a 
first-year graduate student in social science to do so profitably.) The proofs, however, 
are more interdependent. This and the confinement of the subject matter somewhat 
limit the book’s usefulness as a text, but they are justified in that it is certainly a 
beautiful research memoir. 

I have very few criticisms of the exposition. It would have been useful to state 
Fubini’s theorem on double and iterated integrals for positive functions as well as 
integrable ones, since an important method of checking integrability is iterated 
integration of the absolute value. 

It is suggested (p. 213) that a test of circular symmetry of the miss distribution in 
target shooting should be invariant under rotation. But the most important causes of 
error would generally be associated with particular directions, which suggests not 
treating the alternatives symmetrically as regards rotation. 

The theory of invariance, in particular the reduction of a problem through in- 
variance and the importance of compactness of the group, would have been illumi- 
nated by integration over the group. 

In Section 6.10 it is proved that the family of tests associated with an invariant 
confidence procedure is a family of invariant tests. It is as easy to prove more: a 
confidence procedure is invariant if and only if it is associated with an invariant family 
of tests. 

On p. 264 there is the misleading statement that the Wilcoxon signed-rank test 
was proposed by Walsh (Annals of Mathematical Statistics, 20 (1949), 64-81). 
Knowing the signed ranks is equivalent to knowing the signs of all sums of two 
observations, so the general class of tests discussed by Walsh coincides with the class 
of all signed rank tests. I believe that no class of tests singled out by Walsh in the 
paper cited coincides with the class of Wilcoxon tests. Certainly only a few Wilcoxon 
tests coincide with tests actually tabled by Walsh, all at sample sizes under 10 and 
Wilcoxon critical values of 0, 1, or 2. (See also Walsh, J ASA, 54 (1959), 213-24.) 

In Chapter 7 (linear hypotheses), I would have preferred a more thoroughly geo- 
metrical approach—in particular, squared lengths of projections rather than mini- 
mized sums of squares. I find this more transparent, and it clarifies orthogonality (in 
experimental design). 

Equations and theorems are referred to by number but not by page, which is a 
little inconvenient since they are numbered consecutively but unevenly distributed 
through sometimes long chapters. 

These are minor flaws, however. The book achieves its aims and more. It sets 
forth clearly and indeed brings out new beauty in a certain mathematical theory. 
Within this theory, statistical methods of great practical usefulness have been de- 
veloped, and its statements can and frequently do contribute in a vague way to the 
interpretation of data. But this book, by its very excellence, its thoroughness, 
lucidity and precision, intensifies my growing feeling that nevertheless the theory 
is arbitrary, be it however “objective,” and the problems it solves, however precisely 
it may solve them, are not even simplified theoretical counterparts of the real prob- 
lems to which it is applied. 

Tests are usually used either as decision procedures or as inference procedures. On 
p. 60 Lehmann introduces them as decision procedures in the Neyman-Pearson 
formulation. He also says in his preface that investigations from the point of view 
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of Wald’s statistical decision functions essentially confirm the findings of the classical 
theory and that Chapter 1 provides a basis for such a justification. While some cri- 
ticisms I shall make of significance tests apply to some of what is called “decision 
theory,” especially the minimax rule, I remain convinced that, on the contrary, 
some aspects of decision theory constitute an essential advance over the Neyman- 
Pearson formulation, and that they are incompatible except in special cases. For in- 
stance, the significance level a@ plays a central and arbitrary role in the Neyman- 
Pearson formulation, but not in decision theory. Further, a decision theorist testing a 
composite null hypothesis, say #ewo, would be interested in the probability of a Type 
I error, say a(@), as a function of @, not merely in its upper bound a. This means 
that even when a uniformly most powerful test of a composite null hypothesis exists 
at each level a, he would not automatically use one. If, say, the null hypothesis is the 
two-sided one |0| >1, he might prefer a test with a(—1)=.1, a(1)=.05 to a uni- 
formly most powerful test with a(—1)=a(1)=.1. If the null hypothesis is n=0, 
o arbitrary, he might even choose a test with a(c)-—+1 as c— ~, and hence a=1, but 
not the trivial most powerful test at a=1, rejecting the null hypothesis always. Other 
discrepancies between the Neyman-Pearson and decision-theoretic formulations will 
be apparent below. 

On p. 5, confidence intervals for a normal mean are introduced as a purely artificial 
and hypothetical decision problem. It is not clear how even this unsatisfactory ap- 
proach would generalize, what would become of the confidence level, or that the 
findings of “classical” theory would be confirmed. Such difficulties aside, the purpose 
of confidence intervals is for inference, that is, as Lehmann says (p. 4), “providing a 
convenient summary of the data or indicating what information is available concern- 
ing the unknown parameter or distribution.” An equally convenient summary would 
be provided by the midpoint of the interval and its length, or some other one-to-one 
function of endpoints. What has made the confidence interval popular is “indicating 
what information is available.” Decision problems seem beside this point; a con- 
fidence interval probably contains the parameter, and the confidence level measures 
how probably. But does it? By the formal definition, it no longer does, once we insert 
numerical values for the endpoints. Then no probability (except 0 or 1) can be at- 
tached to the event that the interval contains the parameter: either it does or it 
doesn’t. Unfortunately we don’t know which. We think, and would like to say, it 
“probably” does; we can invent something else to say, but nothing else to think. We 
can say to an experimenter, “A method yielding true statements with probability 
.95, when applied to your experiment, yields the statement that your treatment 
effect is between 17 and 29, but no conclusion is possible about how probable it is 
that your treatment effect is between 17 and 29.” The experimenter, who is interested 
not in the method, but in the treatment and this particular confidence interval, 
would get cold comfort from that if he believed it. The confidence-interval theory 
dodges the problem of what information is available about the treatment. 

Tests, too, are often used for purposes of inference. Lehmann says little about this 
except that (p. 62), when there is available a nested family of critica] regions, corre- 
sponding to different significance levels, the critical value [P-value], the smallest 
significance level at which the hypothesis would be rejected for the given observation, 
provides an idea of how strongly the data contradict (or support) the hypothesis. I 
wonder how much of an idea. Prior beliefs about the null hypothesis aside, bare 
significance at the 5% level does not contradict the null hypothesis equally as the 
statistical problem varies or as the sample size varies in a given problem (for instance, 
testing p=.5 against p=.6 with 10 or 1000 binomial observations). Consider also a 
most powerful level 5% test for a simple hypothesis against a simple alternative hav- 
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ing power 99%: if the distributions are continuous, a result which is just significant 
ean hardly be said therefore to favor the alternative, since it is also just significant at 
level 1% when the hypotheses are reversed. In fact, the more powerful the test, the 
more a just significant result favors the null hypothesis. Just how, then, is a test to 
be interpreted as an inference? 

It is possible for an outcome to be significant at one level but not at a less extreme 
level by uniformly most powerful tests (p. 116, Problem 29). This casts doubt on 
significance tests as a general method (for inference or decisions) even in situations 
free of this pathology. 

A relabeling of the possible outcomes which does not affect the outcome actually 
observed surely should not change an inference or decision. However, there are al- 
most always such relabelings which change the P-value and hence may change an in- 
ference or decision based on a significance test. Suppose, for instance, an experiment 
has possible outcomes a, b, - - - , z. Suppose Meter 1 tells the outcome, while Meter 2 
tells only whether the outcome was or was not d. If in fact the outcome is d, you would 
learn this from reading either meter and would want, therefore, to make the same in- 
ference or decision; yet the result of a significance test would ordinarily depend on 
which meter you were reading. A direct continuation of this argument shows an in- 
ference or decision should depend on the probability under the possible hypotheses of 
the outcome observed only (and on this only up to multiplication by a constant). 
The use of the probabilities of other outcomes also, as in the Neyman-Pearson for- 
mulation, inevitably leads to inconsistencies. 

Even if inferences are to be expressed in terms of tests (or P-values) and con- 
fidence intervals, uniformly most powerful tests and uniformly most accurate con- 
fidence intervals may not be appropriate. Suppose, for example, U and V are the 
smallest and largest of a sample of n from the uniform distribution between @ and 
6+-1. The uniformly most powerful level a test of Ho:0 >9 against H,;:0 <0 accepts 
Ho if U > and V >@)+<¢, where c=a"/". It follows that the uniformly most accurate 
level 1—a@ upper confidence bound for @ is min(U, V—c). When V — U >c, @ must be 
below this upper confidence bound, for sure, belying the confidence level 1—a. 
Similarly, letting d be the solution between 0 and 1 of 2d"=a if a<2' and of 2d” 
—(2d—1)"=a otherwise, the confidence interval max(U+d, V) —1<0@<min(U, 
V —d) is uniformly most accurate unbiased and uniformly most accurate invariant 
at level l—a, yet 8 must be the interval if V—-U>d and cannot be in the interval 
(because it is empty) if V —U <2d—1 (which can happen if a >2'*). 

In this problem, more sensible inferences can be made by conditioning on the value 
of V—U=R, say. The distribution of R doesn’t depend on @ and, given R, VU is uni- 
formly distributed between @ and 8+1—R. As Lehmann mentions (p. 7, Example 7), 
if R is near 1, the sample nearly pinpoints @; the smaller R, the less information the 
sample provides. (Yet the length of the uniformly most accurate unbiased and in- 
variant confidence interval described above is increasing in R for 0<R <d and de- 
creasing in R ford <R <1.) 

Thus it appears there are certainly occasions when inferences should be conditional, 
but optimum decision procedures are not. Unfortunately, it is not obvious how to 
shoose what to condition on, and impossible in general to condition on everything 
one might want. Furthermore, this difficulty aside, the unpleasantness that a con- 
fidence interval may include all or no possible values of the parameter, so that the 
confidence level measures no one’s confidence, sometimes cannot be removed, as it 
was above, by a natural conditioning—for instance, in problems concerning the 
ratio of two components of variance (p. 289), the point at which a regression line 
crosses the axis of the independent variable (p. 182), or the ratio of two normal means. 
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Satisfactory criteria have never been given for choosing a “good” inference procedure 
in the Neyman-Pearson formulation. 

I know of no justification, decision-theoretic or otherwise, for unbiased tests as 
such. Lehmann (p. 11) gives an interesting general definition of unbiasedness (equally 
Jacking in inherent justification) and obtains unbiased tests as a special case by tak- 
ing the costs of Type I and Type II errors to be constants b and a. (The significance 
level turns out to be b/(a+5), or 50 per cent if a=b, which seems an unbiased choice.) 

One is driven, finally, to the argument that, for most problems the uniformly most 
powerful unbiased test (if there is one), or a test chosen by some other arbitrary 
criterion, is at least admissible and turns out to have a “good” power curve. How is 
this “goodness” to be judged? We really want a criterion for “better.” Why can’t 
we have one, at least in a theory that is admittedly idealized to make the questions 
and answers precise? Now it is a mathematical fact that the only criteria for “better” 
which are “consistent” (in a pretty inescapable sense) are those which order proce- 
dures according to their Bayes (posterior) risk with respect to some prior distribution 
of the parameter. Lehmann mentions this (p. 14), but doesn’t really come to grips 
with it. If one is being consistent, there is a prior distribution: its existence must be 
reckoned with even if it is regarded as a mathematical artifact. The fact that it is not 
“known” does not eliminate it. A subjectivist feels that the prior distribution means 
something about the state of his mind and that he can discover it by introspection. 
One can reject subjective interpretations, but one cannot escape the force of the con- 
sistency argument just because the parameter is not sampled randomly from a 
population having an objectively known distribution. 

Of course, skillful people can do useful statistics using Neyman and Pearson’s 
formulation. But so can they using Fisher’s, or Jeffreys’, or minimax decision theory, 
or subjective probabilities and Bayes’ Theorem. Neyman and Pearson’s formulation 
has been enormously important and fruitful. For many problems, reasonable-looking, 
practically-feasible procedures have been developed only in this formulation. In fact, 
it is so ingenious and so successful that it is liable to be credited with properties 
which it does not have and which its originators and developers have never claimed 
for it. For the reasons outlined above, it seems to me impossible to interpret signif- 
icance tests and confidence intervals consistently as answers to questions of decision 
or inference using the Neyman-Pearson formulation alone. Such interpretations can 
sometimes be made approximately in particular problems using another formulation; 
but relying on this would make the other formulation the more fundamental one. 

The extra-mathematical difficulties raised in the last part of this review are not 
new. (Extensive and up-to-date references may be found in a paper, “Subjective 
Probability and Statistical Practice,” given by L. J. Savage at a conference in London 
in 1959. The proceedings are soon to be published by Methuen.) But it is under- 
standable that this book with its highly mathematical orientation does not enter into 
such questions. Lehmann has given us a definitive and beautiful mathematical treat- 
ment of an important part of the leading theory of statistics. As such, it will be in- 
valuable to statisticians of all philosophies. 


. Studies in Mathematical Learning Theory. Robert R. Bush and William K. Estes, Editors. 
Stanford, California: Stanford University Press, 1959. Pp. viii, 432. No price listed. 


J. Laurie Snett, Dartmouth College 


A= deal of the current enthusiasm for applying mathematics to the social 
sciences arises from the noteworthy success of the mathematical models for 
learning theory. The book under consideration is a collection of research papers deal- 
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ing with mathematical learning theory, and is an outgrowth of a conference on the 
applications of mathematics in social science research held at Stanford University in 
1957. 

The book is divided into three major parts. The first part deals with the stimulus 
sampling model introduced by W. K. Estes and C. J. Burke. The second part has to 
do with the linear models introduced by R. R. Bush and F. Mosteller. The third part 
deals with extensions and comparisons of these models. 

There is a remarkable completeness to the book in the sense that it deals with 
virtually every aspect of the development of a satisfactory mathematical theory for 
explaining a specific phenomenon. For example, the following aspects occur to the 
reviewer: (a) a specific phenomenon is isolated where consistent experimental results 
are known, (b) a mathematical model is developed and the mathematical founda- 
tions are set forth, (c) mathematical theorems are proved relating the basic descrip- 
tive quantities, (d) statistical estimation techniques are developed, (e) experiments 
are examined in terms of the model, (f) the model is extended and modified, (g) new 
areas of application are found. All of these stages of development are represented in 
the book. 

In Chapter 1, Estes sets forth the foundations of the stimulus sampling model. In 
Chapter 8, Estes and Suppes discuss the mathematical foundations of the linear 
model. The mathematical techniques of finite Markov chain theory are used to de- 
velop theorems relating to the stimulus model and general stochastic process theory 
is applied to the linear model. In Chapter 10, Bush and Sternberg take a simple case 
of the linear model and give a very complete mathematical development of this 
model. Almost every article which deals with experiments has some discussion of 
statistical problems. Particularly interesting attacks on the problem of estimation 
may be found in Chapter 3 by Atkinson and Suppes and in Chapter 13 by Anderson. 
Applications of the stimulus model to experiments in the areas of serial discrimina- 
tion, mediated generalization, vicarious trial-and-error behavior and latency dis- 
tributions may be found in Chapters 4, 5, 6, and 7. In Chapter 14, Galanter and 
Bush discuss applications of the linear model to certain animal experiments. In 
Chapter 2, LaBerge modifies the stimulus model by adding a new type of stimulus 
element in an attempt to explain the phenomena of overlearning. In Chapter 18, 
Bush, Galanter, and Luce discuss an important generalization of the linear model and 
show that this more general model better describes some of the basic experiments. In 
Chapter 19, Suppes extends the linear model to deal with experiments with a con- 
tinuum of responses. New experimental situations are considered by Atkinson and 
Suppes in Chapter 3 and Burke in Chapter 9 where they apply learning models to 
simple two-person interaction situations. With the extension of models and the devel- 
opment of new models comes the responsibility of comparing their respective pre- 
dictive abilities. In Chapter 15 there is an interesting discussion by Bush and Mos- 
teller making a comparison of eight models as they relate to predictions in a specific 
experiment. 

As the above discussion shows, there is a wealth of variety in the articles and yet 
they all deal with different aspects of the same basic type of phenomena and fit 
well together. The reader will find many problems left open in this young field, and 
it is hard to imagine a reader who would not find a question that he would like to 
consider. For example, the mathematician will note that the real mathematical con- 
nections between the stimulus model and the linear model are not given adequate 
treatment. He will also note a certain avoidance of speaking of the limiting distribu- 
tions in the linear model. Can it be that the psychologist is embarrassed by having 
such things as Cantor functions arising from a simple T-maze experiment? The 
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sociologist will no doubt bristle at the intrusion into his subject by the discussion of 
two-person interaction and will be challenged either to prove that it is not true social 
interaction or that, if it is, he can provide the true explanation for what is going on. 
The statistician will be amazed by the wealth of new statistical problems which 
this field has opened up. 

The editors are to be congratulated for bringing together such a fine collection of 
papers. There are, in all, thirteen different contributors, and all papers are on a high 
scientific level. It is clear that the conference which led to these papers was a produc- 
tive one, and from this book the reviewer would guess that it was also an exciting 
conference for the participants. 


An Appraisal of the 1950 Census Income Data. Studies in Income and Wealth, Volume 
Twenty-three. National Bureau of Economic Research, Princeton: Princeton University 
Press, 1958. Pp. x, 450. $10.00. 


Haroup Lypa.u, University of Oxford 


HY do people go to conferences, especially to conferences on income and wealth? 

Of the various possible reasons it is the most obvious one which in this case is the 
most significant. The fact is that “income and wealth” people are passionately in- 
terested in their subject. Only this can explain how it was possible to bring together 
the authors of this volume (and their audience) to churn over at such length the in- 
come data of the 1950 Census. 

Yet the topic is inherently an intriguing one. The United States is one of the few 
countries in the world (there are only about a dozen others) in which questions on 
income have been included in the population census. The practice crept in gradually: 
in 1940 only wages and salaries were asked for in any detail. But in 1950 the ques- 
tions were a little bolder: three to each person, and in principle they covered all types 
of money income. Thus the 1950 Census yields estimates of the distribution of total 
money income, for persons and for families, based on two overlapping samples of 
the population consisting of more than 14 million individuals and more than 7 
million families. What an overwhelming mass of data! 

Up to a point, the purpose of this volume is a very practical one: to consider 
whether (and to what extent) these figures are any use; whether the same sort of 
data should be collected in 1960; and, if so, in what way. But one must not take this 
purpose too literally. The summoners of the conference were not the Bureau of the 
Census but the National Bureau of Economie Research. Behind the scenes—one 
presumes—a much more practical conference (or series of conferences) took place to 
decide what to do about the 1960 Census. For that conference there would have been 
papers showing the costs of collecting and processing the data on income in different 
ways; but for this conference there was none. It is, perhaps, rather remarkable that a 
group of economists should spend several days discussing the value of a product with 
scarcely any mention of its cost. 

As with all collections of conference papers this volume is very uneven. The heart 
of the volume consists of a series of papers dealing with “matching” studies; and 
these are its main justification, for nowhere else—so far as I know—has there been 
such a careful and comprehensive set of tests of the quality of survey material. 
Apart from these, the volume also contains some “introductory” papers (including a 
valuable summary by the editor, George Garvy) and some papers intended to show 
the uses to which census material can be put; but although these are interesting (es- 
pecially those by Selma Goldsmith and Herman Miller) they are only rather loosely 
connected with the central purpose of the book—the appraisal of the 1950 Census 
data. I shall not, therefore, comment on any of these papers in detail. 
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The 1950 Census collected income data from such an enormous sample of people 
that it offered great scope for “matching” studies. There were five such studies alto- 
gether, of which four are reported in this volume. First, the census results were com- 
pared with a sub-sample of the population who were re-interviewed four or five 
months later—in the Post-Enumeration Survey (PES). The PES income sample 
contained nearly 12,000 persons with incomes and 5,500 families. The PES questions 
on income were much more detailed than those asked in the census, the interviewers 
were of better quality and only “best” respondents were interviewed. Hence the PES 
data should be “better.” Unfortunately, however, as they were collected in August- 
September 1950 and the data refer to the calendar year 1949, better interviewing 
techniques may have been largely offset by the larger memory errors. One cannot, 
therefore, take the PES data as the “standard” from which to measure errors in the 
census (as suggested by Pritzker and Sands, p. 208). The most that can be done is 
to see whether the PES technique yields “better” estimates (by which one usually 
means bigger estimates) and to examine the distribution of differences between census 
and PES data for the same units. Pritzker and Sands have dealt adequately with the 
former task but scarcely at all with the latter. They publish two-way frequency 
distributions for males and females but not for families. Regrettably, they give no 
distribution of the differences as such. 

A second “match” of a very similar type was made between the census (taken in 
April) and the Current Population Survey (CPS) which was taken in March. This 
study was not included in the present volume because it had already been published 
elsewhere,' but it logically belongs in this discussion. In this case there were nearly 
5,000 people who reported their income in both surveys, within at most two months 
of each other and in response to very similar questions (somewhat more detailed in 
the CPS). The surprising result was that less than half the persons who reported 
some income in both surveys were in the same income class (classes up to $5,000 
were $500 in width, after which there were three more classes, including an open- 
ended class of $10,000 and over). In spite of this the shapes of two distributions and 
the medians were very similar. Here we have an indication of the minimum amount 
of reporting error likely to be obtained in surveys of this type (in fact, it is even poss!- 
ble that it is less than “minimum” because some people must have remembered the 
answers they gave to the CPS when they were interviewed a month or two later in 
the census). It seems a pity that this study was not more fully analysed and that 
there was not a report of that analysis in this volume. 

A third match was made by re-interviewing about half the units interviewed in the 
1950 Survey of Consumers Finances (SCF) during the PES. Here there was a gap 
of about seven months between interviews, the sample was small and the techniques 
of interviewing were different. There were also great difficulties in making reliable 
“matches” between the families, since the SCF had not recorded the names of 
persons interviewed. An analysis of 619 units which were considered satisfactorily 
matched in other respects showed only 63 per cent in the same income class, despite 
the fact that these income classes were twice as wide as in the previous study ($1,000 
up to $5,000 plus three more classes). Messrs. Sirken, Maynes and Frechtling have 
made a very thorough study of the results of this experiment; but, in the nature of the 
ease, they are unable to throw much light on the quality of the census data as such. 
The most important lessons from this study seem to have been drawn by the SCF, 
which reorganised its technique of sample selection from 1956 onwards. 





1 Herman P, Miller, “An Appraisal of the 1950 Census Income Data,” Journal of the Americaa Statistical Asso- 
ciation, 48 (1953), 28-43. 
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The above three “matches” are all based on comparable surveys and are therefore of 
limited value for the purpose of appraising the census data. But the next two are 
more to the point. First, there was a match between census wage and salary earners 
and OASI wage and salary records; and second a match between census families and 
IRS tax data. This is more exciting. 

For the OASI match 12,000 individuals aged 14 and over were drawn from the 
census and their names were sought in the OASI records. 7,800 were found, but only 
about 2,000 of them were listed in each record with one (and the same) employer. Of 
these only 1,400 were recorded with less than $3,000 in both records (the OASI limit 
at that time). For this group Messrs. Mandel and Wolkstein and Miss Deianey have 
prepared a frequency distribution of the differences between the wages reported in the 
two records. Unfortunately, they do not give the actual figures, nor do they calcu- 
late the variance of the differences; but they do show the distribution in a diagram, 
and it is obviously very close to being Gaussian, with a slight upward bias. This is 
fascinating material, but again inadequately analysed. 

The piéce de résistance is, of course, the comparison with the tax records. For this 
purpose the 12,000 persons aged 14 and over in the income sub-sample of the PES 
were listed and a search was made for their tax returns. There was the usual difficulty 
with making exact matches, enhanced in this case by the fact that many people with 
incomes below the tax exemption limit do not bother to file a return. Complete 
matches were made for 55 per cent of families and unrelated persons and partial 
matches for another 13 per cent. For the completely matched group the overall dis- 
tributions and the medians are remarkably close (although Miller and Paley’s sug- 
gestion that the differences are “not statistically significant” shows a complete mis- 
understanding of what is at stake) and to that extent the results of the test are re- 
assuring. But when we come to the table (p. 198) which shows the two-way classi- 
fication of individual units according to income reported in the census and in the tax 
returns the picture is much more disturbing. Only about 45 per cent of units are to 
be found in the same income class (ranges of $500 from $1,000 to $5,000, plus four 
more classes) and a quite appreciable number are in completely different classes. At 
the extremes, there are some units who reported $10,000 or more on one occasion and 
less than $1,500 on the other. (Can these really be accurate records of what happened, 
or are some of them coding and tabulation errors?) 

In general, one gets the impression that the tax records behave like “just another 
survey.” Yet this can hardly be so. Unless our confidence in official statistics is to be 
very rudely shaken the tax records must be better than that. This important issue 
could have been investigated if only the tabulations from the 1949 Audit Control 
Program had been just a little more detailed. The paper by Marius Farioletti which 
reports some of the results of this study shows that on the average tax returns are 
biased downwards (as one would expect); but the vital comparison of individual re- 
turns in a bivariate distribution is missing. Here is surely a project which should be 
taken up in the 1960 Census. Let a sample of those who have given their incomes in 
the census also be put through the auditing process (naturally, without revealing 
census details to the IRS). Then we shall get as close as humanly possible to finding 
the relation between incomes reported in a survey and “true” incomes, 

As ean be seen, this volume 1s full of meat, and income statisticians will need a good 
deal of time to digest it all properly. What is regrettable is that these valuable ad- 
ditions to our knowledge have not been as thoroughly analysed as they might have 
been. Even this group of papers was not presented until 1956 (six years after the 
census), nor published until 1958; and your reviewer is writing about them in 1960! 
All things considered we ought to be able to do better than that. 
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Individual Choice Behaviors A Theoretical Analysis. R. Duncan Luce. New York: John 
Wiley and Sons, 1959. Pp. xii, 153. $5.95. 


Patrick Supres, Stanford University 


N THE midst of several Social Science Research Council activities sponsored by the 

Committee on the Mathematical Training of Social Scientists at Stanford Uni- 
versity in the summer of 1957 there circulated a long mimeographed report by Dun- 
can Luce which we called the “red menace” because of the color of its cover and 
because of the many lively discussions of its contents. That report has been expanded 
into the present book, and I recommend it to all who are interested in the theory of 
choice behavior. 

The book consists of five chapters and four appendices. Chapter 1 sets forth the 
basic theory. The heart of this chapter is Axiom 1 introduced on p. 6. The first part 
of the axiom states that if R is a subset of S, and S is a subset of 7’, then when a re- 
sponding organism is faced with the choice of one of the al-ernatives in the set 7’, the 
probability he will select one from the subset R is equal to the probability he will select 
one from F given a choice from S times the probability he will select one from S given 
a choice from 7’. In more perspicuous mathematical language, Pr(R) = Ps(R)Pr(S). 
The reader will note at once that this equation holds for conditional probabilities, 
but it is important to emphasize that this part of the basic axiom is not a simple truth 
about conditional probabilities, for the actual choice set (S or 7’) changes, not simply 
knowledge about the subset from which the choice was made. I like to think of the 
matter this way. Suppose we have an experiment in which subjects respond by push- 
ing one of three keys. In any theory of choice behavior we want to be able to predict 
the conditional probability of choosing one of the keys, given that the choice was 
made between two specified keys. This prediction simply deals with one aspect of 
the three-key experiment. Luce’s axiom is concerned with quite another matter. 
Essentially, his axiom may be used to make predictions about a second experiment, 
given results on the first. In this second experiment we block one of the three keys and 
force the response to be one of the remaining two. Some axiom of behavior is required 
to predict behavior in this new situation. 

The second part of Axiom | also bears on this experiment. It simply says that the 
probability of selecting an alternative from any subset S given T' remains constant if 
we delete from T an alternative z which is dominated in preference by some y in T. 
(In the formal statement of this part of the axiom on p. 6 there is an error in the posi- 
tion of the quantifier binding “z,” but the interpretation following makes the intended 
meaning clear.) As is apparent, the second part of the axiom is particularly apposite 
to the sequence of two experiments just described. 

Axiom 1 comes very close to expressing in a new form a much discussed principle 
of the last decade, namely, the principle of the independence of irrelevant alterna- 
tives. Essential use of this principle was made by Arrow in the proof of his impossibil- 
ity theorem for social decision functions. Luce’s version mainly differs from Arrow’s 
in being probabilistic in formulation, but the more important thing is that he has 
exploited its consequences in new and surprising directions, which are marked out 
in the remaining four chapters. 

Chapter 2 is concerned with applications to psychophysics. A gloss of the six cases 
considered is not easily given. They range from Fechner’s law to signal detection and 
rank orderings. The simple yet highly ingenious applications of the basic axioms in 
this chapter depend on the construction of a scale v(x) for alternatives defined in the 
following way: 

P(x) =v(z)/ Dv(y). (1) 


yeS 
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This v-seale, which is unique up to a similarity transformation, is the main tool used 
to attack the problems in all the remaining chapters, not just in Chapter 2. It is ap- 
parent from (1) precisely how the basic axiom enters, namely, because it is postulated 
that choice probabilities for different sets of alternatives satisfy the principle of in- 
dependence of irrelevant alternatives, v(z) may be consistently defined independently 
of S. Luce’s generalization of v to a function of two variables in order to treat inter- 
action of psychophysical continua is particularly elegant, all the more so because of 
the relative neglect of this topic in the psychophysical literature. 

Chapter 3 deals with applications to utility theory. Additional axioms are in- 
troduced in this chapter; by and large they are similar in spirit to those of von Neu- 
mann and Morgenstern, but there are important differences. Luce is not sympathetic 
to the expected utility hypothesis, and he feels that an over-emphasis on this hy- 
pothesis has had a negative effect on descriptive psychological theories of choice be- 
havior. The difficulty with his alternative approach in terms of his concept of a de- 
composable preference structure is that the results are too specific not to be falsified 
by much actual choice behavior. For example, Theorem 12 asserts that if pure alter- 
natives are not perfectly discriminated, then there are exactly three equivalence 
classes of mixed alternatives, i.e., of alternatives that are probability mixtures of 
pure ones. 

Chapter 4 considers applications to learning. In this case the problem is to sup- 
plement Axiom 1, which is static in character, with additional dynamic postulates 
concerning changes in choice behavior over trials. These postulates are stated in 
terms of transformations of the v-seale defined above. Axioms for what are called the 
alpha, beta and gamma models are given. The alpha model is linear in the transforma- 
tions of response probabilities from trial to trial; the beta model is linear in the trans- 
formations of the v-scale but not of response probabilities. The gamma model prima- 
rily differs from the beta model in having imposed on it a boundedness condition for the 
v-scale. The alpha model is not new, being the linear response model much studied in 
the past few years. Most of the chapter is actually devoted to properties of the beta 
model, which originates with the author. It is premature to make an evaluation of 
the empirical adequacy of the beta model, but it can be reported that it is, unfor- 
tunately, difficult to work with mathematically. Moreover, it is the reviewer’s own 
experience that in spite of its apparent attractiveness and simplicity, formulation of 
the model in terms of linear transformations on the v-scale is of no real help. The sec- 
tion on asymptotic properties of the beta model is rather incomplete, and progress 
since the book appeared on determining asymptotic and other properties has not 
essentially depended on use of the v-scale. 

Chapter 5 consists of a brief summary and conclusion. The first appendix gives 
alternative forms of Axiom 1, the second determines the form of the latency distribu- 
tion for a certain class of choice situations by using a continuous analogue of Axiom 1. 
The third appendix derives maximum likelihood equations for the two-alternative, 
two-outcome beta learning model; the results here are mainly due to Robert R. Bush. 
The fourth appendix lists a number of open problems, some of which are conceptual 
and empirical, and some of which are mathematical. I particularly liked the idea 
of this appendix. It is especially desirable in a rapidly developing subject to have a 
current summary of open problems. 

I conclude with two general remarks. Luce states at the beginning that his theory 
of choice behavior is in'a sense orthogonal to stimulus-response psychology because 
the basic assumptions are about relationships between choice sets and not connec- 
tions between stimuli and responses. He goes on to say (p. 2), “Such an approach 
seems to merit careful consideration, since several decades of such S-R psychology 
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have not resulted in notably simple laws of behavior.” My objection to this view is 
that Luce’s own postulates do not provide a schema of the elementary processes 
which determine behavior. It seems much more pressing to ask at once why does his 
Axiom 1 hold (when it does) rather than to ask why in the nature of things stimuli 
become conditioned to responses. The reason it is more pressing is that S-R theory 
represents scientific conceptualization at a deeper level. One has the feeling that the 
schematized processes of stimulus sampling and conditioning correspond in a rough 
way to the dynamic physical processes of an organism in a choice situation; in con- 
trast Luce’s own theory rides on the surface of observable phenomena and postulates 
no underlying processes. I do not mean to suggest that Luce’s own work is not im- 
portant. It is just that I do not think it is really an alternative to S-R theory. To 
emphasize this point, I mention that a simple derivation can be given of Axiom 1 
from the fundamental postulates of S-R theory as formulated by Estes and me if the 
additional postulate is added that the conditioning parameter associated with a 
possible response is independent of the particular subset of possible responses avail- 
able for choice. Admittedly this postulate is also very much in the spirit of the general 
principle of independence of irrelevant alternatives, but unlike Axiom 1 it does not 
involve simultaneous consideration of several subsets at once, but is concerned pri- 
marily with S-R connections. 

My second remark is that a more detailed statistical treatment of the empirical 
adequacy of the theory would have been desirable. The data that are cited too often 
provide suggestive rather than anything like definitive evidence in favor of the 
theory. The coverage of the book is qualified in the title by the phrase “theoretical 
analysis,” but it is a happy fact that in experimental psychology it is hard to get a 
hearing for a theory without some kind of supporting data, a situation which too 
often does not obtain in several branches of the behavioral sciences. This criticism is 
meant to be a mild one, for I know that Luce and his associates at the University of 
Pennsylvania are now hard at work on experimental tests of the theory on several 
fronts and are also struggling with the difficult statistical problems which arise in 
testing many of the applications mentioned above. There is little doubt that the 
present book is an important contribution to the increasingly large literature on 
choice behavior. 


Wages and Earnings in the United States 1860-1890. Clarence D. Long. National Bureau 
of Economic Research Number 67, General Series. Princeton: Princeton University 
Press, 1960. Pp. xviii, 169. $4.00. 


Seymour L. Wotrsern, U.S. Department of Labor 


N THIS brief volume, Clarence Long presents a detailed exposition of both the 

methodology and the results of his painstaking investigation into the course of 
money and real wages and earnings during a most volatile period in American eco- 
nomic history. This is not a mere exercise in the discovery and splicing of various 
relevant statistical series. It is a craftsmanlike job which involved exploring the 
anatomy of the available data, performing many recalculations based on the pri- 
mary sources, so that one obtains also a critical review and evaluation of prior efforts 
in this field. 

Long finds that manufacturing wages rose by about 50 per cent between 1860 and 
1890. Since the cost of living (measured, incidentally, with a notable assist from 
Ethel Bieter of the Bureau of Labor Statistics who constructed a new series for the 
period 1860-1880) was back to about its pre-Civil War level in 1890 after rising 
sharply during the War itself, real wages were also up by about 50 per cent during 





BOOK REVIEWS 175 


this period. Other matters of substantive interest include the finding that there 
was apparently no narrowing (in fact, some evidence of widening) of skill differentials 
in wages. 

In a country with such a relatively brief history as the United States, any 30- 
year period is bound to be significant. Yet, 1860-1890 did include a war, a long de- 
pression, and we did cross some major thresholds toward the modern industrial age. 
It would, therefore, be of keen interest to the current scholars and practitioners in 
this field to see an analysis of the relationships of these findings to such overridingly 
important forces as productivity, size and composition of the labor force, trade union 
growth, etc. In his concluding chapter (of about 10 pages), Long does touch on these 
but any detailed investigation was beyond the scope of his volume. We join with the 
author in the hope that such analytical investigation does get to take place. 


Debits and Clearings Statistics and Their Use. George Garvy. Washington, D. C.: Board 
of Governors of the Federal Reserve System, 1959. Pp. ix, 144. Price not listed. Paper. 


E. T. Weiter, Purdue University 


TATISTICAL series dealing with the volume of check payments have long been con- 
7 sidered important in recording the ebb and flow of business conditions. Students 
of business cycles, as well as financial analysis, are indebted to George Garvy for his 
very careful review of (1) the derivation of the various check-payment series, whether 
based on clearing-house statistics or on bank debits, and (2) the attempts of eco- 
nomists and statisticians to use these series to test alternative hypotheses, 

The flow of check payments the latter part of the nineteenth and early part of the 
twentieth centuries have been, for the most part, derived from clearing house reports, 
Garvy discusses in detail the pitfalls and weaknesses of these series, showing how 
changes in the structure in the banking system, as well as changes in reporting prac- 
tices of banks and clearing-house organizations, have tended to introduce biases into 
these series. He concludes that for purposes of cyclical analysis the more compre- 
hensive debit series available since the end of World War I could be spliced to the 
clearing-house statistics, but for the purpose of trend analysis splicing is not feasible. 

Garvy then turns to a review of the efforts made by Mitchell, Frickey, Schumpeter, 
Snyder and others to develop meaningful time series for analytical purposes. He also 
considers briefly the efforts made by Frickey, Burns, Snyder, Copeland, Keynes and 
others to use these series to test hypotheses from monetary theory. 

Because of the careful craftsmanship of Garvy’s work and his comprehensive 
bibliography, scholars expecting to use historical records relating to check payments 
will no doubt make this a standard reference book. 

In general, it seems to this reviewer that more credit should go to work of this kind 
than is usual these days. In the long run, it will be work such as Garvy’s, charac- 
terized as it is by a careful derivation of statistics and attention to detail, which will 
advance our understanding of the monetary mechanism. 


Analyses of Industrial Operations. Edward H. Bowman and Robert B. Fetter, Editors. 
Homewood, Ill.: Richard D. Irwin, Inc., 1959. Pp. 485, $7.95. 


Harvey M. Waaner, Stanford University 


4 ge book is a collection of 26 articles, 19 of which first appeared in the journals 
Management Science, Operations Research, and Naval Research Logistics Quarterly. 
The stated objective is to present a series of case studies showing the actual applica- 
tion of quantitative methods for the analysis of indusirial operating problems. By 
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way of extensive coverage, the papers pertain to petroleum blending and chemical 
processing; industrial manufacturing processes; railway freight train and aircraft 
scheduling; electric power investments; warehousing and distributing seasonal prod- 
ucts; production, inventory, and employment scheduling; machine maintenance and 
replacement; congestion processes; and military logistics; several of the articles con- 
tain studies from British and French industries. The main line mathematical tech- 
niques encompassed are linear programming, stochastic inventory theory, and queue- 
ing theory. The editors are to be commended on their broad selection. 

The mathematical prerequisites for most of the articles are not slight, and the 
reader will need a good background in typical operations research topics to fully 
comprehend and intelligently evaluate the contributions. For successful use of the 
book in an advanced seminar, a situation which the editors suggest, the instructor 
must be able to explain many of the technical details which undoubtedly will be 
obstacles to understanding for a large number of students. The papers typically 
are not written for an unsophisticated reader desiring a simplified guide to applica- 
tions. 

There is some question whether the collection truly represents a compilation of 
actual case studies, as alleged. Without doubt the papers are devoted by and large 
to solutions of problems suggested by actual industrial situations. But it is not sub- 
stantiated that most of the offered solutions have been implemented, and of those 
that have, whether they have been continued and have yielded significant economic 
benefits. Consequently if the editors’ sole intent is to offer their readers a set of 
honest-to-goodness industrial applications, the book falls somewhat short of this 
goal. But if, instead, the main attempt is to convey the manner in which present day 
management scientists spproach industrial problems, the power of mathematical 
modeling, and the versatility of operations research techniques, then the task is re- 
markably well achieved. 


The Theory of Committees and Elections. Duncan Black. New York: Cambridge Uni- 
versity Press, 1958. Pp. xiii, 242. $5.50. 


Gerawp L. Tuompson, Carnegie Institute of Technology 


I" ORDER to evaluate the contribution to political thinking of Professor Black’s 
volume it is necessary to consider first the preface and Part II of the book. Accord- 
ing to the preface, the work is a highly individual contribution of Professor Black’s 
that he has been preparing over a number of years. Essentially the work was com- 
pleted in 1947 but “failed to obtain publication” at that time. The difference between 
the present version and the 1947 version is an historical section devoted to articles 
published on the subject by Borda, Condorcet, Laplace, Dodgson, and others. The 
date of the latest article included in Part II is 1907. The only other major source of 
inspiration for this study seems to be classical (Marshallian) economics. Therefore, 
as far as the influences of the literature are concerned, the present volume could 
have been published anytime after the turn of the present century. 

Among the more recent developments that would, in the reviewer’s opinion, have to 
be taken into account in any modern treatment of political theory are: the theory of 
games (especially simple games), welfare economics (Arrow’s contribution is based on 
a simple voting paradox), psychological theories of behavior, some mathematical 
disciplines (e.g., axiomaties, graph theory, set theory, matrix algebra), and recent 
statistical work on paired comparisons. The present volume could have benefited 
from some of these theories and is considerably less sophisticated than many of them. 
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For the reasons just cited the reviewer cannot recommend the book as a definitive 
work on the state of the subject. However, it does have its own merits and does con- 
tribute a good deal to political science thinking. 

The basic object studied in this book is a committee that is considering a number 
of motions, only one of which they can accept. One of the motions, ao, represents the 
existing state of affairs. Each voter has a preference schedule that rank orders the 
motions. To decide which motion is to be accepted they are voted on either in pairs 
or in toto. Usually a simple majority rule is used to decide which, if any, motion is ac- 
cepted. The objective of the book is to provide methods for deciding, giving the 
preference schedules of the voters, which measure should be adopted. 

Two primary assumptions run throughout the book. One is that each committee 
member can rank order all the motions being considered. The other is that the mem- 
ber will always vote, given two motions, for the motion that is higher in his rank 
ordering. In Chapter XVI, Professor Black mentioned some work by himself and 
R. A. Newing that does not depend on the first assumption. The second assumption is 
held throughout the book. To remove this assumption would, of course, drive the 
theory close to the theory of games. 

There are essentially two different models used in the book for the analysis of the 
problem. In the first model, the assumption is made that there is an ordering of the 
motions in which every voter’s preference schedule is single peaked. In this model, 
it is sometimes additionally assumed that the number of motions is infinite and that 
the preference function is differentiable. In the second model, no assumption on pref- 
erence schedules is assumed but merely a count is taken of the number of people who 
prefer a given motion to another one. Here the number of motions is finite and a 
matrix notation issued to record the votes. The entries of the matrix are pairs of 
numbers. Thus in the 7, jth entry there is a pair (a, 6) in which a is the number of 
votes for i over j and b is the number of votes for j over 7. Hence the entry at j, ¢ is 
(b, a). Thus the same information is recorded twice in each matrix. This is unfor- 
tunate because if the definition had been that the entry in 7, 7 was the number of votes 
for ¢ over j (i.e., a) then the entry in j, i would be the number of votes for j over ¢ 
(i.e., b). Then, perhaps, use of matrix operations might have been used successfully 
to analyze such matrices as is done in some of the recent statistical literature on paired 
comparisons. 

For the most part the study is made of the simple majority method of deciding an 
election. For cases where there is no such majority, he also considers the suggestions 
of Borda, Condorcet, and Dodgson for resolving the conflicts. Other problems con- 
sidered are: the virtues and vices of proportional representation; international agree- 
ments; complementary motions; and the elasticity of committee decisions. 

There are several weak points in the book. The discussion in Chapter III on the 
idea of a preference schedule and its justification by appeal to authority via econom- 
ics seems quite naive to the reviewer. The objections which are raised and demolished 
by saying that such things have been done for years in economics are, to the review=r, 
as much objections against economics as to the proposed theory. Chapter VII on 
cyclical majorities suffers from technical defects in that Proposition 5 is false and the 
later propositions that depend on it are therefore in need of reworking. Finally, the 
remark on p. 150 that “Politics has to take its cue from Economics” seems patently 
false to the reviewer. 

On the other hand this book raises a number of interesting questions. The problem 
stated in Chapter XV on determining the set of all schedules corresponding to a 
given voting matrix is interesting. The connection of this theory with the theory of 
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games is especially obvious in Chapter XII and a further investigation would be de- 
sirable. Questions of party structure, vote trading, coalitions, bargaining, all seem 
to be worthy of study in the political context, 

Let us hope that this book will serve as an inspiration and starting point to a 
mathematical theory of politics—doubtless that is what Professor Black would him- 
self most desire. 


International Migration: 1945-1957. International Labour O fice. Studies and Reports, New 
Series, No. 54. Geneva: 1959. Pp. xiv, 414. $4.00. Paper. 


Ricuarp A. Easteruin, University of Pennsylvania 


nis study provides a worldwide survey of postwar international migration 

through 1957. Because of its distinctive nature, causes, and effects, “political” 
migration is treated separately from economic. The discussion of political movements, 
which while on a greater scale were highly localized and largely non-repetitive in 
character, is confined to about the first third of the volume. Particular attention 
is devoted to the flows to West Germany and Israel, but the discussion ranges also 
over the European refugee problem generally, the Palestine Arab refugees, the shifts 
between India and Pakistan, and repatriation and refugee problems in the Far East. 

Part Two, “Economic Migration,” is divided into three sections, dealing succes- 
sively with the nature, causes, and effects of postwar economic migration. Section A 
takes up first the movement within continents. The discussion is organized around 
the principal countries of immigration on each continent and covers so far as possible 
both volume and sources of immigration. Attention is then turned to the inter- 
continental movements, chiefly European emigration, with the treatment covering 
volume and direction in the principal countries of both emigration and immigration. 
Section B, “Economic Migration Factors,” contains chapters on “Laws and Regula- 
tions,” “The Labor Demand and Supply Throughout the World,” and “Organized 
and Assisted Migration.” The first part of Section C considers the demographic ef- 
fects of the movements, on the size and age-sex composition of the population, on 
fertility and mortality in countries of origin and destination, and on the occupational 
composition of the labor force. In contrast to the discussion in Section A which 
covers both annual and total movements during the period, the analysis here relates 
almost wholly to the cumulative effects. The remainder of the section is concerned 
with the effects of the economic flows on the level of income and employment in the 
sending and receiving countries, and on the migrants themselves. A brief concluding 
chapter of five pages suggests some general principles for possible guidance in the 
formulation of national and international migration policy. 

Virtually all the statistical analysis in Part Two is concentrated in three chapters— 
5 and 6, relating to the volume and direction of migration, and 10, to the composition 
according to age, sex, and occupation. The underlying data include not only migra- 
tion statistics proper, but also figures on voyagers or passengers, work or residence 
permits, and even census returns. Unpublished materials in the hands of the ILO as 
well as published sources, detailed at the end of chapter 6, were employed. Of par- 
ticular value is the critical analysis, running through chapters 5 and 6, for each of a 
number of countries of the migration data. The conclusion is reached that “the 
statistics of immigration countries tend to confirm the view that the figures compiled 
by emigration countries are probably too low” (p. 204). 

This book will be of value to those who wish information on the pattern, causes, 
and effects of postwar migration, on the legal framework of the movements and 
suggestions for migration policy, and on the nature and reliability of migration 
statistics. 
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Mathematical Methods for Digital Computers. Anthony Ralston and Herbert S. Wilf, 
Editors. New York: John Wiley and Sons, Inc., 1960. Pp. xi, 293. $9.00. 


C. L. Perry, Stanford Research Institute 


ODERN general-purpose automatic digital computers greatly expand the variety 
M of analytical techniques that are practical to use. For a method to be practical 
for automatic computation it must be programmable, fit in the computer memories 
and not require too many computer operations. This book is a compendium of de- 
scriptions of computation procedures. The format of the chapters generally includes 
sections on the function (i.e. use) of the numerical method described, a mathematical 
description of the method, summary of the calculation procedure, a flow chart for 
the computer program, memory requirements, estimation of the computation time, 
and references. The description of the computation procedure does not include the 
detailed coding of the program. This detailed coding is left for the user to adapt for 
his computer. 

The twenty-six chapters by twenty-four well known authors include a comprehen- 
sive chapter on comparison of methods for computation of the elementary mathe- 
matical functions and six chapters on the inversion of matrices and the solution of 
linear equations by elimination, Gauss-Seidel iteration, conjugate gradient method, 
Monte Carlo methods, and rank annihilation. Chapters of interest to statisticians also 
include computational procedures for multiple regression, factor analysis, autocorre- 
lation and spectral analysis, analysis of variance, numerical quadrature by deter- 
ministic and Monte Carlo methods, linear programming problems, Fourier analysis, 
numerical solution of polynomial equations, determination of the characteristic roots 
of a matrix by the Jacobi method, and chapters on the numerical solution of or- 
dinary and partial differential equations. 

Most of the chapters include a sample problem, some contain error analyses, and 
a few include comparisons of computation procedures and discussion of pitfalls. The 
chapters generally stand by themselves and there is very little reference to other 
chapters. Some short proofs are given and references to more comprehensive articles 
are given. 

This book should be of most interest to readers who are just becoming acquainted 
with numerical methods and have finished one of the basic texts in numerical analysis. 
It will also be of interest to individuals who are interested in using one of the methods 
presented. The reviewer recommends that the book be included in all computation 
libraries. The individual purchaser, however, should consider it as a supplement to 
the more basic text on numerical analysis. Some readers will find that one or two of 
the authors use different terminology than they are familiar with. The editors warn 
the reader to be wary of flaws in the flow charts although many of them have been 
extensively tested. 

The format chosen for the presentations in the individual chapters is designed with 
the user in mind. When the use of machine independent compilers such as ALGOL 
become widespread, it would be well worthwhile to incorporate such programs in the 
text. 

The publishers are to be congratulated on the excellent legibility, particularly in the 
equations and flow charts. 


Statistical Methods in Biology, Norman T.J. Bailey. New York: John Wiley and Sons, 
1959. Pp. ix, 199. $4.50. 
Lincotn E. Mosss, Stanford University 


HIS is a good thin book. In 160 pages of text the author lays out an amazingly 
large variety of statistical techniques. A good heuristic justification ordinarily 
accompanies each such presentation, as does a worked example. 
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The book is directed to workers in the biological and medical sciences and, in the 
author’s words, is intended to provide them “with an elementary account of the chief 
statistical methods liable to be needed in practice.” In addition to material to be found 
in most elementary texts, there is a chapter on computations including sound advice 
on significant figures, decimal places, computational checks and computing machines 
(convincing admonitions regarding the slide rule and sums of squares appear here). 
Techniques associated with Poisson distributed data are also included. 

A novel feature of the book is a 30-page section at the end entitled “Summary of 
Statistical Formulae.” This section is formally self contained (but also keyed back to 
the text by page references); it includes 38 numbered paragraphs with titles like 
“Comparison of a percentage based on a large sample of size n >30 with a known 
standard” or “Calculation of regression coefficients,” or “Bartlett’s test for the 
homogeneity of several variances.” There is a good table of contents, a fair index, 
and an adequate minimum of tables. 

The book is indeed designed to be a cook book; it is a very good one, since it is thin, 
lucid, and virtually error-free. From the beginning to the end the reader is exhorted 
to turn for statistical advice if things are at all complicated—and to do it early. 

It is almost impossible to object to omissions from so slight a volume already sur- 
prisingly full of inclusions. However, some readers may wish that the 15 or 20 pages 
on correlation had been spent on other topics such as short-cut procedures, or non 
parametric tests, or simultaneous confidence procedures in the analysis of variance, 
or even sequential tests. However, any book embodies the author’s choices of what 
material to include, what to exclude. In this instance the choices seem well adapted 
to the audience. 

It is the reviewer’s experience that first year medical students find it readable. It 
seems to be a book very well suited to two uses. First, it should make a good text 
for instructing medical students in elementary statistics. Second, it should make a 
good statistical manual for the laboratory worker in biological science. 


Numbers from Experiments: A Basic Analysis of Variation. C. C. Li. Pittsburgh: The 
Boxwood Press, 1959. Pp. ix, 106. $4.00 cloth, $2.75 paper. 


Cart E. Hopkins, University of Oregon Medical School 


His unique little book comprises the theoretic part of a short course in experiment 
design and analysis offered by the author to “mature graduate students... 
physicians, microbiologists, epidemiologists, biochemists, and occupational and public 
health statisticians.” It fills a real gap in the armamentarium of teaching materials 
for applied statistics. Teachers faced with teaching experiment methodology to 
graduate students, experimental scientists and the like need no longer worry about 
how to give their non-mathematician students sufficient of the theory underlying 
analysis of variance techniques that they can know what they are doing in the welter 
of formulas, prescriptions and manipulations that constitute the usual course in ex- 
periment design. In 14 short chapters the internal arithmetic and mechanical rela- 
tions in the ANOVA are thoroughly developed without resort to any higher math- 
ematics. 
This book does for the non-mathematical graduate student what the Mann book 
does for the mathematical statistician. It is all done with simple algebraic derivation 
from a few fundamental identities which are given as definitions, e.g., 


ssq = } (ya — 9)*. 


After some brief introductory remarks about experiment design principles, Chapters 
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2-5 explain the properties of addition and the = notation as a basis for study of the 
properties of the sum of squares, which are developed by establishing the identity 
Z(ya— 9)? =Zy.’—ns*. This sum is illustrated by a geometric diagram with which 
readers of this Journal will be already familiar (50:(1955)1056-63). Mean and vari- 
ance of a population are defined and theorems are developed by simple algebra for 
the expected values of y, y’, 9, (—)* and s* under random replacement sampling, 
with no distribution assumptions. Then the sum of squares is subdivided into be- 
tween and within groups sums. To this point the development has been purely ab- 
stract algebra, with application to experiment problems only adumbrated. 

In Chapters 6-8 the theory of ANOVA begins with introduction of the linear 
model for expressing an observation as a sum of independent parts, 


Yia = wi + 7: + €ia- 


A lively poker game played on Grandma’s 84th birthday furnishes a vivid para- 
digm. The last square estimates of population parameters are shown to minimize 
the error sum of squares. Here brief resort is had to the differential calculus, but 
this is done behind the scenes, with only the resultant normal equations being 
given to the student. The expectations of the various sums of squares are developed, 
by algebra, for both Model I ANOVA and Model II (components of variance). 
Finally, the assumption of normal distribution in the population is introduced so 
that the variance ratio may be used to test hypotheses about the mean squares. The 
relations of F, @, xz? and u* are exhibited clearly. 

With the preliminary tools now mastered the student moves in Chapters 9 and 10 
into ANOVA itself, starting with the hierarchical classification schema of one-way 
analysis, followed by the two-way case. Throughout the exposition the expectations 
are written out, making it easy to see directly what hypotheses may be tested and 
why. The various types of inequality and disproportionality and their consequences 
are discussed. The usual partition of the sum of squares and their expectations is 
given, but with the difference that the source and meaning of the interaction term 
emerges quite clearly. 

Two chapters are devoted to linear transformations (rotation of axes) and the sub- 
division of the sum of squares into orthogonal contrasts with single degrees of free- 
dom. An exemplary set of four simple examples clearly illustrates the computations 
and the kinds of contrasts likely to be of interest including the factorial design and 
the examination of non-linear components. 

Two final chapters carry the same algebraic method of development into sums of 
products, linear regression and analysis of covariance, again invoking the calculus 
only for obtaining the normal equations. 

A briefly annotated list of references leads the student to a few of the best of the 
“general,” the “classical” and the “modern” textbooks, including the rather similar 
in approach though more comprehensive text by the author’s brother, Jerome C. R. 
Li. A table of contents, index and abbreviated F table complete the book. 

Paper and printing are of good quality, though the paper cover is not very sturdy 
for a textbook (available in boards). Errors are uncommonly rare, though we did 
find at bottom of page 12 an ny? instead of nj*. The chapters are well divided accord- 
ing to major topics, and likewise the paragraphs within the chapters. The latter, 
however, might have contributed further to pedagogical clarity by being labelled 
topically instead of numerically. 

The geometrical aids may cr may not be helpful, depending on the mental habits 
of the individual student. At any rate, the algebra is quite clear without the geometric 
visualization. 
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The notation is rigorous, consistent and complete, with simplest forms mercifully 
used where possible according to the reduction scheme: 


an 


Lta= Le = Lia Ly = '. 


At first it is disconcerting to see a instead of i as the principal subscript, y instead of 
x as the variate, Y for y, and y as an observation rather than a deviation. These are 
presented clearly, however, so that the student should have little difficulty learning 
to recognize them and to translate to the forms used in other standard texts. The 
use of ssq for sum of squares seems less happy orthographically than SS, and when 
Professor Li comes to the sum of products he is obliged to switch notation to (rz), 
(yy), and (zy) for the sums. 

The presentation uses extremely simple numerical examples, which are clearly 
artificial, but serve really to extend the generality by focussing attention on the 
algebraic relationships rather than on the computation or the empirical problem. 

This book pretends by no means to be a complete treatise on analysis of variance 
and design of experiments. In such a basic treatment the author can be forgiven for 
omitting the great variety of experiment designs readily available in other books, 
and common statistical concepts such as random sampling, probability, t-test, dis- 
tribution function, normal distribution, etc., are appropriately taken for granted or 
developed in the lectures and discussion that accompany the book in course use. One 
hopes also that the taken-for-granted part includes a grasp of the two kinds of error 
in testing hypotheses, and power of the test, for these are not mentioned. In such a 
basic exegesis it seems unfortunate that the student is given no answer to the peren- 
nial question “why square the deviations?” And why use a least square model? What 
other models would be possible, and what their merits and demerits? 

In sum, this is not an elementary statistics text, nor a procedural cookbook of 
experiment design. Nor is it an advanced treatise on theory of the ANOVA. But for 
the rather mature but non-mathematician science student it is a clear and concise 
bridge to comprehension of the internal mechanics of the ANOVA procedures. Al- 
though designed for a lecture accompaniment, the book is sufficiently self-contained 
to be tractable for self-study by an advanced graduate student, an experienced ex- 
perimenter, or a teacher of applied statistical procedures. 


Finite Markov Chains. John G. Kemeny and J. Laurie Snell. Princeton, New Jersey: D. 
Van Nostrand Co., Inc., 1960. Pp. viii, 210. $5.00. 


GueNn E, Baxter, University of Minnesota 


S THE title suggests this book is concerned with the theory and applications of 

finite Markov chains. The authors intend their book to be useful as a textbook in 

an undergraduate course in probability as well as useful as a reference book for 
workers in fields outside mathematics. 

According to the authors the book contains a “new approach” to finite Markov 
chains. This new approach presumably simplifies the treatment so that a minimum 
of mathematical background is needed. Briefly, this approach is that of defining a 
number of so-called “fundamental matrices” in terms of which interesting informa- 
tion about the Markov chains can be derived. Although these fundamental matrices 
are useful in certain investigations, they do not shed any light on one central prob- 
lem, i.e. the problem of finding the n-step transition probabilities. The computation 
of the n-step transition probabilities is usually difficult (or virtually impossible) for 
the general Markov chain. However, the computation is often very simple for finite 
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chains. It is noteworthy that nowhere in their book do the authors consider the probe 
lem of finding the n-step transition probabilities. 

Chapter I contains a brief summary of prerequisites for the remainder of the book. 
Chapter II contains the basic definitions of Markov chains, a list of examples, and 
the classification of states. The discussion of classification of states is too condensed 
to be easily understood. The authors rely too heavily on material which is included 
only in summary form in Chapter I. The reviewer is also bothered and somewhat 
mystified by the phrase “truth value” which appears prominently in the definitions 
and theorems of section 2.1. 

Chapters III through V contain a detailed investigation of finite absorbing, regular, 
and ergodic Markov chains. The fundamental matrix in the case of the absorbing 
chain is defined to be ([—Q), where Q is submatrix of the transition probability 
matrix corresponding to the transient states. The authors demonstrate an ingeniously 
large number of uses for this matrix in analyzing the behavior of the absorbing chain. 
Quantities which are expressed in terms of the fundamental matrix include absorp- 
tion probabilities, mean and variance of occupation times in the transient states, 
and mean and variance of the number of changes of state. For the regular chain 
another fundamental matrix is defined and similar uses are delineated. Chapter VI 
contains additional information about absorbing and regular chains. A number of 
useful tricks for obtaining absorption probabilities and mean first passage times are 
presented. The second half of the Chapter VI is devoted to a discussion of “lumpabil- 
ity,” ic. forming a “new” Markov chain from a given chain by combining sets of 
states into single states of the new chain. Illustrations and problems are numerous 
and help to explain the theory. However, the “Land of Oz” example is overworked. 
This example, which concerns weather prediction in the Land of Oz, is the authors’ 
favorite, and it appears in almost every section of Chapter VI. By the time the reader 
reaches the end of this chapter, he may well feel he has become the Wizard of Oz. 

The best part of the book is Chapter VII. The theory developed previously is ap- 
plied to a number of well-chosen probability models. These include the random walk 
and applications to sports, genetics, learning theory, mobility theory, and economics. 
The authors are concerned with three aspects of each example: the assumptions 
inherent in the model, its mathematical analysis, and the interpretation of the 
mathematical results. Except for one example, (mobility theory), too little attention 
is paid to the first of these aspects. On the other hand, the mathematical analysis and 
interpretation are handled very well. Particularly good in this respect is the applica- 
tion to genetics. However, even in this otherwise excellent example, the authors fail 
to inform the reader as to the assumptions concerning passage of genes from parents 
to offspring. In the learning model, there is no discussion of the assumptions which 
enable an analysis in terms of the theory of Markov chains. 

Chapter VII ig the authors’ best argument for use of this book as a textbook in a 
course in probability, and it is a good argument. The rest of the book is less outstand- 
ing and cannot be given a high recommendation. The material of Chapters III 
through VI, however, will make the book useful for reference purposes. 


An Introduction to Experimental Design. William S. Ray. New York: The Macmillan Co., 
1960. Pp. X, 254. $6.50. 


Martin Fox, Michigan State University 


pe book is intended as a text for graduate students (primarily those in psychol- 
ogy) with little or no mathematics beyond high school algebra. An attempt is 
made to present to such students some of the methodology of experimental design. 
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Real understanding cannot be imported since this requires students with (1) a small 
degree of mathematical maturity, (2) a good elementary course in probability theory, 
(3) at least one thorough year course in statistics, and (4) some knowledge of group 
theory. Instead the student is expected to have had some sort of an elementary, non- 
mathematical statistics course. 

In the opinion of this reviewer it is a mistake to try to teach statistical methodology 
to students with so little background in mathematics. About all one can hope to 
accomplish is the presentation of those methods which are in most common misuse. 
One can never develop in such students a full appreciation of the conditions under 
which the procedures are valid or even approximately valid. Instead the student is 
forced to memorize another set of formulae to place along side those imperfectly 
remembered from his elementary course. In due course, he is awarded his degree and 
told to march out from his Alma Mater to join the swelling ranks of the misusers. 
For this reason this reviewer can only wish that this book and others akin to it had 
never been written. ; 

The first five chapters are largely introductory and provide some review of ele- 
mentary statistical methods. Chapters 6 and 7 contain some basic material on 
hypothesis testing. Chapter 8 contains a discussion of the assumptions made in 
analysis of variance and the consequences of their violation. The remaining nine 
chapters provide a catalogue of some methods in design of experiment such as 
matched pairs (Chapters 9 and 12) and factorial designs (Chapters 11 to 15). 

A typical example of the shortcomings of this book is provided by the handling of 
assumptions needed in the analysis of variance. Instead of a clear statement of the 
linear model there are a few vague statements about additivity. The normality as- 
sumption and the consequences of its violation are discussed fully. Independence is 
not mentioned and the discussion of the requirement of homogeneous variances is 
inadequate. In fact, homogeneity of variances is confused with additivity. 

Level of significance and power are discussed in Chapter 7. However, the student 
is left in the dark about the existence, let alone use, of the tables and charts of the 
power of the /- and F-tests. 

Since the students for whom this book is intended are so ill prepared, the author 
was unable to present a general analysis of variance table. Instead the table (with 
little or no justification) is presented for each special case. 

The Appendix contains extremely abridged tables of the F- and X*-distributions. 
There are no problems other than the examples whose solutions sre presented. 

The great danger in books such as this is the false security they create. Too many 
students, after courses using such books, feel they know statistics. Far better for such 
students would be books and courses leading them to the recognition of statistical 
problems and to an idea of the principles behind their solutions. Let these students, 
when faced with statistical problems of their professions, seek the advice of com- 
petent statistical consultants. 


Introduction to Probability and Statistics. Henry L. Alder and Edward R. Roessler. San 
Francisco: W. H. Freeman and Company, 1960. Pp. vii, 252. $3.50. 


T. A. Bancrort, Iowa State University 


_— authors state that this book has been written to serve as a textbook for a one- 
semester introductory course in probability and statistics and has been used for 
several years before revision in mimeograph form on the Davis Campus of the Uni- 
versity of California and at some other institutions. Mathematical knowledge equiv- 
alent to two years of high school algebra is stated as a desirable background. With 
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such a limited background some theorems are stated without proof. To avoid the 
mathematical difficulties associated with limiting processes, demanding a knowledge 
of calculus, discussions have been restricted, wherever possible, to the finite case, in 
particular, all populations are assumed to be finite. It is maintained that such a 
course in basic statistics, available to all departments, should be centrally located in 
one department, which, in most schools, would be either the mathematics or statistics 
department. 

After a short introductory chapter containing a brief account of the history of 
statistics, attention is given to measures of central tendency and dispersion. A chapter 
on probability is followed by respective chapters on: the binomial distribution, the 
normal distribution, large sample theory and methods, small sample theory and 
methods through the use of Student’s ¢ in comparison of means of two groups and 
paired comparison, the sign test, regression and correlation, and some uses of the chi- 
square situation. Two short chapters are added: one on index numbers and one on 
time series. 

How shall this book be placed in relation to the numerous other introductory texts 
in statistics? The inclusion of some history of statistics is to be commended. Also the 
reviewer agrees that the course in basic statistics or rather the principles of statistics, 
available to all departments, should be centrally located in one department (statistics 
or mathematics). However, it seems to the reviewer that the raison de etre of such a 
course should lie in its emphasis on statistical inference. For a one semester course it 
is doubtful that a text can give the proper emphasis and coverage to even the ele- 
mentary statistical inference principles, and at the same time include even elementary 
mathematical proofs of numerous theorems and extensive exposition of descriptive 
statistics and large sample or asymptotic theory. As an example, the text under dis- 
cussion makes no attempt to present the principles of interval estimation and tests of 
hypotheses in connection with simple regression. Again no attempt is made to explain 
a simple analysis of variance procedure and sume of its manifold practical uses. Since 
a proper exposition of even elementary mathematical statistics, including statistical! 
inference theory and derivations requires at least sophomore calculus, it would appear 
desirable to present the introductory course, requiring only simple algebra, as a prin- 
ciples course without proofs or derivations and with major emphasis on the prin- 
ciples of statistical inference. In such case another course including derivations and 
proofs, requiring calculus as a prerequisite, could be made available to students who 
are interested and qualified. 


Statistical Analysis in Psychology and Educstion. George A. Ferguson, New York: Mc- 
Graw-Hill, 1959. Pp. 347. $7.00. 


BenJAMIN Rosner, Brooklyn College 


ERGUSON’s text is a tightly written yet lucid introduction to contemporary statis- 

tical methods. Although aimed primarily at students with minimal mathematical 
training it maintains the simplicity and directness of more mathematical presenta- 
tions and avoids the excessive description and language too frequently encountered in 
less well-written texts. 

The text lends itself to a one-semester or full-year course. The first half considers 
the essential material of descriptive statistics while the second half is concerned pri- 
marily with statistical inference. While the treatment of much of the introductory 
descriptive material is clear and concise it does not appear of sufficient uniqueness to 
warrant particular mention. In contrast with the book jacket description of the unique 
consideration of the topic of probability—“unusual in an introductory text of this 
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kind .. . ”—the treatment of the material on probability is neither so unusual nor of 
such magnitude as to merit special testimonial. But it is certainly adequate and cer- 
tainly clear. 

The text gains in distinction and quality as it moves from descriptive to inferential 
considerations. Most impressive is the treatment of the analysis of variance, especially 
the two-way classification. Here, consideration of the choice of appropriate error terms 
in fixed, mixed, and random models is particularly clear, while discussion of the prob- 
lem of unequal numbers or disproportionality in subclasses is straightforward and 
practical. 

Somewhat unusual is the discussion of statistical transformations. Not only are 
transformations treated under a single chapter heading but the chapter itself is re- 
served for a later section of the text. Although Ferguson discusses several transforma- 
tions to achieve desirable distributional properties, the greater emphasis is on trans- 
formations appropriate for the interpretation of test scores. Many instructors in 
one-semester courses may prefer to extend the initial discussion of percentiles, stand- 
ard scores, and other normative indices by requiring the reading of much of this 
later material. In a year’s sequence its placement is appropriate. 

The emphasis on educational and psychological measurement is considerably re- 
inforced by an up-to-date and generally lucid discussion of the reliability coefficient 
and the standard error of measurement. Particularly useful is the reference to Lord’s 
recent work on the error of measurement associated with the magnitude of a test 
score. Here, however, Ferguson’s presentation might be improved by a more careful 
delineation of the steps involved in determining the confidence interval for a par- 
ticular test score. In an introductory text, the equation 
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is likely to be confusing. 

An unusual feature in Ferguson’s excellent introduction to partial and multiple 
correlation, at least as far as American texts are concerned, is his discussion of 
Aitken’s method of pivotal condensation rather than Doolittle’s approach to the 
solution of multiple regression equations. This is a welcome innovation, for American 
students are not likely to find the few references to Aitken’s procedure, while they are 
apt to stumble upon the Doolittle method in a number of places. 

In an introductory text, reviewers are likely to find several statements with which 
they can take issue. So it is here. For example, on p. 24 Ferguson states that in calulat- 
ing the mean, standard deviation, etc., from grouped data the assumption is made 
that all observations are concentrated at the midpoint of the interval. This does not 
appear to be a necessary assumption, for the midpoint may be regarded as the best 
estimate of a number of observations uniformly distributed over the real limits of the 
interval. 

A more meaningful issue lies in his discussion of one-tailed vs. two-tailed tests of 
significance. Throughout the discussion (p. 135) Ferguson implies that in either case 
the null hypothesis, Ho, is an hypothesis of equality; i.e., Ho:u1=pe. If Ho is rejected, 
then the investigator may accept Hy:uiX%ue or Hitpi<pe or Hytpi>pe. “A test 
leading to the acceptance of H,:4:% 2 is known as a two-tailed test.” “A test leading 
to the acceptance of either H,:u;>p2 or He:u;<pe is known as a one-tailed test.” 
(Italies his.) Clearly the only alternate hypothesis to Ho:yi=pe is Hituirype. If 
decisions of the kind Hi:y;>p2 or Hi:y; <2 are to be made then these must be al- 
ternatives to Ho:ui<pe or Ho: py: >p2, respectively. 

The foregoing points at issue should not detract from the general high level quality 
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of the text. Introductory statistical texts will continue to appear on the market. If 
they have any reason for being, it is because of the increased clarity of their presenta- 
tion and/or their description of more current and less widely known procedures. 
Ferguson’s text fully satisfies these criteria. It is a contribution to the statistical 
know-how of the behavioral scientist and should enjoy a wide market. 


Introduction to Probability and Statistics. B. W. Lindgren andG. W. McElrath. New York: 
The Macmillan Company, 1959. Pp. xii, 277. $6.25. 


Harry WEINGARTEN, Navy Department 


HIs text provides an introduction to the basic notions and concepts of mathemat- 

ical statistics and probability theory. Calculus is a required prerequisite, with here 
and there some knowledge of what may be found in courses in advanced calculus 
needed. Formally, the book has twelve chapters and these are Chapter 1—Simple 
Probability Models, Chapter 2—Random Variables and Distribution Functions, 
Chapter 3—Discrete Distributions, Chapter 4—Continuous Distribution, Chapter 5 
—Sums of Random Variables, Chapter 6—Sampling, Chapter 7—Presentation and 
Description of Data, Chapter 8—Testing Statistical Hypotheses, Chapter 9—Testing 
Estimating Location, Chapter 10—Estimating and Testing Variability, Chapter 11— 
Comparison of Two Populations, Chapter 12—Some Further Topics. Appendix A 
deals with notation and gives a glossary, Appendix B provides a set of required tables 
for use with the text and Appendix C provides some references to other texts. The 
appendices are followed by answers to all the exercises, except for the review problems 
at the end of each chapter. 

Overall, it may be concluded that a very good job was done. The first chapter while 
sketchy is still adequate. It would perhaps be preferable to have a rearrangement of 
Chapters 2, 3, and 4. Chapter 3 could quite nicely take off from the first chapter as a 
generalization of the idea of solving individual probability problems and then move 
to the notion of a probability function. Chapter 4 could then follow with the idea 
of a density function. After this has been disposed of, the cumulative distribution 
function would provide a nice completion of the topic. Most students at this level 
would probably find this order helpful for the purpose of grasping the concept of a 
random variable in general and the subsequent ideas of density and c.d.f. The discrete 
situation would then fellow along a pattern already staked out. They are used to see- 
ing an integral and the integration of the density to obtain a probability is a pleasant 
idea. I am afraid that the appearance of the general idea of a c.d.f. in Chapter 2 may 
be too hard for most students to absorb easily. The rest ¢’ the text from Chapter 5 on 
is very nice indeed. There is a distinct modern flavor which is evidenced by the topics 
covered, for example the Kolmogoroff-Smirnov statistic, but yet full attention has 
been given to the important fundamental ideas contained in the central limit theorem 
and the Neyman-Pearson lemma. All the ideas are clearly set forth and the number 
and kind of illustrative examples are excellent. This text was used in a course I 
taught to engineering students at George Washington University, but would certainly 
be useful as a text at the undergraduate level for students intending to go on to 
graduate work. I would like to add that having taught other courses in statistics at 
a very elementary level, it was refreshing to see in Chapter 7 of the text, that the 
quesiion of selecting class boundaries when preparing a frequency table from a set of 
raw data, was nicely covered and given all the attention it deserves. 

Some of the errata found in the book are as follows. Page xii, second line from the 
bottom should be changed from F.95 to F.99. Page 7 last line, change (n-k!) to 
(n-k)!. Page 11, top, p and g have been interchanged. Page 38, in problem 2-10 we 
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find 1-3/4e7. The intent is to have 4e” in the denominator of a fraction whose nu- 
merator is 3. As printed this will not be clear to students. Page 41, top line, change 
(|X| <2) to (| x| <<Vz). Page 57, middle of page, change e*, the coefficient of a 
sum of two terms, to e~”*. Page 79, line 20, the letter “e” has been left off the end of 
the word “value.” Page 138, in the third line, a fraction is set equal to 1.28. It should 
have been —1.28. The subsequent answers were calculated correctly for the positive 
value. However, since this was wrong, we find the correct answers instead are n=78 
and k=41. The answers to the problems also contain a few errors. I will give the 
problem number in parentheses and the correct answer, which should replace that 
given in the text: (1-25), 88/4165; (2-7), F(x)=0 if r<—1, F(x) =}(z+1) if 
—l<2<l, F(x) =1 if z>1; (2-8), .75, .5, .45, 1, 0, 7; (2-10), (3/4)e~, 3, 2, 0, 1-(3/4)e>; 
(2-16), 0, 4, log 2, log (3/2); (3-21), 30, 35/12; (3-25) 9.7; (3-31), .00115, 20; (3-34), 
.9606, .9994, 4; (4-1) (b) 3/4; (4-9), w/2-1, w-3; (5-11) (b) .7286. There are probably 
more. 


Statistics: With Applications in Management and Economics. Earl K. Bowen. Home- 
wood, Illinois: Richard D. Irwin, Inc., 1960. Pp. xi, 415. $6.50. 


ZANE M. Powemis, [llinois Public Aid Commission 


His book is designed to serve as text for an introductory course for students in 

business administration, industrial management, and economics, and, according 
to the author, contains subject matter for a course up to five semester hours in 
duration. It is composed of a preface, 16 chapters, and an index. 

The first chapter serves as an introduction and the last chapter includes the usual 
tables of logarithms, areas and ordinates of the normal probability distribution, and 
square roots of numbers up to 999. The second chapter is devoted to basic calcula- 
tions including ratios and percentages, significant digits and rounding, logarithms, 
and the use of the slide rule. The third chapter covers data collection, and sources 
of data, and touches lightly upon some methods of collection, usually found in intro- 
ductory texts. The next three chapters deal with tabular presentation, graphic pres- 
entation, and averages and variability in that order. Chapter 7 contains what is de- 
scribed in the preface as part of new material—Interpretation, Implication and Anal- 
ysis. Chapters 8 through 11 discuss index numbers, trends, seasonal variation, and 
cyclical fluctuations. Regression analysis is covered in Chapter 12. Sampling inspec- 
tion, sample survey principles, theoretical probability distributions and sampling 
distributions constitute the contents of the last four chapters. 

The chief purpose of this review is not so much to criticize Professor Bowen’s book, 
as to question the meaningfulness of the contents of this and other similar books, and 
to raise the question, “what should the contents of an introductory text in statistics 
be?” 

Most of the recent output in introductory texts of this type has tended to be more 
of an assemblage of techniques rather than a presentation of the fundamental con- 
cepts of what constitutes statistics. Generally, they are a how-to-do-it lot rather than 
what-it-is. 

With this method of approach to statistics, the student is likely to acquire some 
knowledge, but of a mechanical sort. It is questionable whether he has learned very 
much in terms of understanding of statistics. The development of a point of view, 
a manner of logical thinking and reasoning, is far more crucial than the mere acquisi- 
tion of a smattering of knowledge about techniques. 

A topic that is often reduced to terms of techniques is analysis of economic time 
series. There is a serious question in this reviewer’s mind about the desirability and 
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advisability of including, at the introductory stage of statistics, trend analysis and 
the measurement of cyclical fluctuations. This is not to say that the economist 
should not engage in time series analysis or that such methods should not be taught 
to students in economics. What is contended here is that they are not a part of the 
introductory course in statistics. Trend fitting, whatever its value, or the use of the 
National Bureau’s indicator analysis, for example, may be taught to advantage only 
to students with substantial background in both economics and statistics. These 
methods are hazardous even at the hands of the expert. 

One cannot be altogether out of sympathy with the author for including in the book 
a chapter on basic calculations. Most anyone who has taught classes in elementary 
statistics recalls only too well that the chance mention of the words logarithm or 
reciprocal is guaranteed to evoke on the part of some students each term the be- 
wildered cry, “what is that?” Also, the division of an integer by a decimal all too fre- 
quently results in a variety of quotients. Perhaps one is no more obliged to teach the 
setting of the decimal place in a course in statistics than he is to teach spelling. The 
preparation of a handbook or a manual embodying the necessary arithmetic opera- 
tions essential in statistics may be the answer. It will certainly avoid the cluttering of 
statistics books with principles and methods that should have been learned prior to 
admission to the course, or which the student should be able to learn, or review, on 
his own. 

Some final comments on the arrangment of the book and completeness of treat- 
ment: A more unified approach to data collection would have been achieved if the 
principles governing data collection were discussed in the same chapter describing 
the various methods employed. The same is true with respect to probability. Chapter 
13, devoted to sampling inspection, contains a section on elementary chance calcula- 
tions, while Chapter 15, dealing with theoretical probability distributions, develops 
a probability concept. The introduction to probability might have been more com- 
plete if the two relevant sections were combined in one chapter. 

The new material that has been developed in this book, according to the preface, 
consists of Chapter 7, Interpretation, Implication and Analysis, and of the introduc- 
tion of the properties of probability distribution by the use of the isosceles distribu- 
tion in Chapter 15. Chapter 7, however, contains only a faint glimmer of its poten- 
tialities, and should have been greatly expanded. Had the theme suggested by the 
title been developed, the author would have made a positive contribution to the 
teaching of statistics and would have enhanced the value of the book. 

The isosceles distribution is a novel method of introducing probability distribu- 
tions. It is a frequency polygon superimposed over the histogram of a symmetrical 
frequency distribution with the shape, naturally, of an isosceles triangle. This 
“smooth” curve forms a model for easing students into the normal curve concept. 
Whereas in a normal distribution the intervals between z and # are measured in 
terms of the standard deviation, in the isosceles distribution approach such intervals 
are measured in terms of the range. Whether this method leads to a simpler and easier 
introduction to the concept of theoretical probability distributions, or contributes 
further to the confusion, remains for those who have tried it to say. 


Statistics in the Making: A Primer in Statistical Survey Method. Mary Louise Mark. 
Columbus, Ohio: The Ohio State University, 1958. Pp. xxv, 436. $5.00. 


Joun M. Firestone, The City College of New York 


g spove book, written by a sociologist, fills a serious gap in the textbook literature for 
students of statistics. Whereas most textbooks in this field tell the student about 
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sample designs and what to do with data that have been collected, too little atten- 
tion has been paid to the design and organization of a statistical study. 

The author, in her preface, states: “Some students know they need this type of 
instruction (the production phase of statistical work)—particularly those preparing 
for administrative work ... other students with no awareness, thus no felt want, 
and no statistical equipment except courses in the analysis of tabulated results, elect 
to explore statistical aspects of problems in their field of interest at firsthand, only to 
find themselves fumbling painfully and wasting effort and time they can ill afford.” 

A listing of the topics covered in this book best demonstrates the order of thinking 
and the approach to organization of a statistical study. The principal topics discussed 
include: 

The subject of Inquiry; Planning the Report. Here the orientation of the study is 
discussed, the formulation of the purpose, delimitation of the subject, preliminary 
exploration including both library and field research. 

The Substance of Inquiry—the problem of availability of materials for inquiry, 
informant competence, interviewing technique and interviewer qualifications. 

Sampling is introduced with the query “What can we afford?” The logic of sam- 
pling together with a non-mathematical description of the more common methods of 
sampling and a discussion of bias and error are the topics covered. 

Tabular presentation includes a discussion of types of statistical tables, classifica- 
tion, and table structure. 

There is a full discussion of the schedule, its function, design, and instructions. 

The collection of data by interview, including making the contact, eliciting card 
recording information and the problem of self-verification, and the use of the mail 
questionnaire are fully described. 

The use of punch cards and various tabulation systems, tabulation procedures, 
and the derivative table both as to form and the calculation of commonly used ratios 
are treated before the reader is introduced to the preparation of the report, including 
the writing of the report, the techniques of writing as well as the report’s form. 

The last part of the book deals with the statistical organization and its personnel 
together with suggested methods of administrative control. 

The organization of the book, the topics treated, and the emphasis on the pitfalls 
in the way of the unwary researcher clearly show that the author has had a wealth 
of practical experience in the organization of research. The lucid style and exposition 
indicate the accumulation of experience in teaching these methods to the student. 

The statistics teachers in our colleges and universities might well give thought to 
rounding out their courses in statistics with a course in this subject matter, and they 
could do worse than use this book as its basic text. 


Elementary Statistics. Sidney F. Mack. New York: Henry Holt and Company, Inc., 1960- 
Pp. ix, 198. $4.50. 


Jutes Josxow, City College of New York 


{ex text is a lucidly and carefully written presentation of most of the topics of 
elementary statistics covered in an introductory course where the emphasis is on 
inductive methods. The book was written for the three-credit one-semester course in 
elementary statistics which freshmen and sophomores in the nontechnical curricula 
at Penn State University can elect to help fulfill their science-mathematics require- 
ment. In consequence, it is the declared intention of the author to teach the student 
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some mathematics around a framework of statistics. In this reviewer’s opinion, the 
author has achieved almost the converse. The student is taught some statistics 
around a framework, albeit extremely limited, of mathematics. 

The limited scope of the mathematical treatment of most topics is recognized by 
the author and might best be summarized in his own words (p. 6): “Because of the 
limitations in the required mathematical background of the student [high school or 
elementary school algebra] it is hardly possible to examine here even elementary 
concepts to much depth. ... Many of the theorems are presented without proper 
mathematical proof, simply because an understanding of their proofs entails several 
years of college mathematics, particularly calculus. In order to go along with the 
spirit of the development of the subject matter, the student will have to accept on 
faith the validity of many of the theorems.” 

The book contains 10 chapters, 4 appendixes, 4 tables, and a large number of brief 
exercises at the close of each chapter. After a brief introduction the student is given a 
limited review of selected topics in elementary algebra. A brief second chapter is 
devoted to the statistical description of frequency series. This is followed by an 
overly long chapter—considering later applications in the text—on the fundamentals 
of probability. The remainder of the text is devoted to discussions of the normal, 
“Student,” and chi-square distributions; statistical estimation and tests of hypoth- 
eses as applied to the mean and to ratios; and linear correlation and regression. The 
student is offered an excellent set of challenging exercises at the close of each chapter. 
Significantly, however, these relate almost exclusively to application of the methods 
discussed. Rarely is the student required to perform the type of mathematical 
manipulation that one might expect given the author’s intention to teach mathe- 
matics. A large number of the problems relate to technical] applications of the subject 
matter. 

Most of the proofs of the theorems employed in the text are of an intuitive or of an 
“it can be shown” type. Algebraic transformations of the standard deviation and of 
the correlation and regression coefficients as well as the derivations of the mean 
and standard deviation of the binomial variate are presented in the appendixes. A 
rather long and involved derivation of the least squares normal equations is given 
in the chapter dealing with regression. It is unfortunate that the author did not take 
advantage of his prior use of the covariance in the definition of the correlation co- 
efficient to demonstrate the derivation of the least squares regression coefficients by 
application of the assumption that the residuals around the line are uncorrelated 
with the “independent” variable. Such a derivation requires no more mathematical 
sophistication than that required for the method shown and has the added advantage 
of acquainting the student with an important characteristic of least squares regres- 
sion. 

While most texts that attempt to summarize briefly and palatably the essential 
facets of elementary statistics tend to be characterized by a fuzziness and lack of 
rigor in definition, such cannot be said for this work. The author is to be commended 
for his careful formulation of the matter he has chosen to deal with. It might have 
been better, however, if he had added a few pages to his brief treatment to advise 
the student more thoroughly concerning the limitations of the methods discussed. 

The nontechnical students at Penn State who use this text undoubtedly receive an 
excellent introductory course in inductive statistics while at the same time satisfy- 
ing their science-mathematics requirement. One cannot help wondering, though, 
whether modern day educational needs might not be better served if such students 
were required first to expand their very limited knowledge of mathematics per se. 
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Methods of Correlation and Regression Analysis. Linear and Curvilinear. Third edition. 
Mordecai Ezekiel and Karl A. For. New York: John Wiley and Sons Inc. 1959. Pp. xv, 
548. $10.95. 


Roger Miter, University of Wisconsin 


pmrrers of Ezekiel’s “Methods of Correlation Analysis” (second edition, 1941) 
A will be reassured to hear that the book retains its basic excellence and is signif- 
icantly improved in this third edition. In the latter part of this review I will offer a 
brief comparison of the editions, but I feel compelled to discuss the third edition on 
its own merits for the benefit of those who, like myself prior to undertaking this re- 
view, have never had occasion to be exposed to the earlier version. 

The book begins with a brief but lucid review of elementary statistical description 
and inference and quickly introduces the notion of functional relationships among 
variables. Regression analysis is launched in the bivariate case and extended to the 
multivariate case with a full discussion of the distinct problems which arise in the 
latter. The temptation to carry over interpretations of coefficients from the bivariate 
to the multivariate case is adequately squelched in the process. In each case, an 
unusually complete discussion of curvilinear regression follows the more standard 
presentation of linear regression. The notions of sampling variability and degrees of 
freedom are continually emphasized throughout, despite postponement of the dis- 
cussion of tests of significance on the regression and correlation coefficients to a later 
section. Of particular interest to economists is a chapter on “The use of error formulas 
with time series.” A section on miscellaneous methods contains a chapter on the rela- 
tion between one variable and two or more others operating jointly, a chapter on 
regression where qualitative variables are involved, a chapter on “Cross-classification 
and the analysis of variance,” and a brief treatment of regression applied to systems 
of simultaneous equations. The latter chapter by Fox contains a nice description of 
the identification problem and a comparison of the estimates of the coefficients of 
structural equations with the estimates derived from least squares directly applied 
to single equations. In the course of this discussion, Fox neatly raises the issue of the 
relative relevance of recursive as opposed to fully simultaneous equation systems. 

No single text appeals in all respects to all potential users, of course, and the pro- 
liferation of texts gives ample opportunity for supplementation. We should be ex- 
ceedingly grateful to Ezekiel and Fox, therefore, for providing us with a text which 
does an extremely thorough job of presenting the technique and rationale of Jeast 
squares and freehand graphic methods of regression analysis, especially with respect 
to curvilinear regressions, since most texts slight these topics. 

The authors are not afraid to express their views on controversial issues. While I 
did not always agree with the authors I did not feel that their positions were un- 
reasonable, nor were their presentations of the issues unfair. It was seldom that I 
could find a statement that was downright misleading or wrong, and in these cases it 
was usually not clear whether the error should be attributed to the printer or to a 
slip of the authors’ pens—it never seemed appropriate to attribute it to a misconcep- 
tion of the authors. The point is that Wiley and the authors have produced a book 
which is remarkably free from errors of any kind. The extreme clarity of the writing 
has kept awkward or ambiguous phraseology to a minimum. The authors have labored 
to get across in a common sense manner the essential meanings of the techniques em- 
ployed in and of the results of empirical research. In order to avoid over-interpreta- 
tion or misinterpretation, they take pains to iterate their cautionary statements, but 
this repetition is never obtrusive or boring. 

The most attractive single feature of the text to me was the repeated expression 
of the role of theory as a guide to empirical research, especially the emphasis on the 
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necessity of logical bases for variables included in regressions and for the forms of the 
regression equations that are fitted. The uselessness of applying techniques based on 
elaborate statistical theory to problems for which the investigator has no theory 
relating to the subject matter is amply stated. 

The authors have written a text that purports to require “only a modest level of 
mathematical training” on the part of its readers—essentially this is intended to be a 
“non-mathematical text.” Because of their intelligence and integrity, the authors 
have not permitted this to stand in the way of correctness or completeness, as un- 
fortunately happens with so many “non-mathematical” treatments. It is, no doubt, 
because little mathematical background is assumed for the reader that the authors 
are at some pains to spell out the meanings of algebraic manipulations and results, so 
that they also avoid the excessive terseness which characterizes mathematical treat- 
ments. One might have some reservations, however, as to just how simple the algebra 
that ts included will seem to the student. It appears to me that anyone capable of 
following the algebra that is presented (and of really understanding the verbal ex- 
planations) should be capable of grasping the essential features of matrix algebra. 
The increased simplicity of presentation that would result should certainly have 
been worth the introductory chapter this would entail. The computational tech- 
niques developed by Friedman and Foote, which are presented in the middle of 
Appendix 2, could then be integrated with the text and the laborious working through 
of examples by the Doolittle method thereby abbreviated. In spite of this, the book 
deserves wide use as a text (primary or supplementary) because of its basic common 
sense and clarity and because of its full treatment of important techniques that are 
not elaborated elsewhere. (Those who wish to omit these portions of the text dealing 
with freehand curve fitting or with “shortcut graphic methods” will find that this is 
quite easily accomplished.) 

As compared with the 1941 edition, the book is considerably improved (the new 
preface generously attributes a major role to the junior author in this revision). 
The material has been rearranged, chapters are grouped into sections to indicate the 
structure of the book, the treatment of the analysis of variance in relation to regres- 
sion problems has been extended, the treatment of error formulas for time series has 
been revised, and the treatment of statistical inference and uses of standard errors 
has been revised, particularly with respect to small sample interpretations. Although 
it merely introduces the subject, the new chapter on “Fitting systems of two or more 
simultaneous equations” is excellent. Modernization throughout the text has altered 
the notation (S replaces o for the standard deviation of a sample), has revised the 
phraseology—*“Let us suppose that 15 more samples had been made, each from 20 
farms selected at random, ...” is replaced by “Let us suppose that 15 more samples 
had been taken, each of 20 farms selected at random, ...,” and has recognized the 
technological progress that has occurred in agriculture since the prior edition (the 
average corn yield per acre for the sample in the first example in the book has in- 
creased by 10 bushels per acre, or 334%—the example is remarkable in that every 
farm in the sample shows exactly the same absolute amount of increased yield be- 
tween editions). 


Sampling in a Nutshell. Morris James Slonim. New York, New York: Simon and Schuster, 
1960. Pp. xiii, 145. $3.50. 


Harowp Nissetson, U. S. Bureau of the Census 


HIs small volume is an expansion of a layman’s pamphlet prepared by the author 
for the U. S. Air Force, and later reprinted in this Journal (52 (1957), 143-61). 
To the original material there has been added some further discussion, including the 





194 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1961 


subject of sample size, and some material on quality control charts and acceptance 
sampling and almost half the book is now devoted to applications and case histories. 
The style is similar to that of the original paper, and makes entertaining as well as 
informative reading—a difficult task for an author. 

Most of the case material is very good, but not all the additions are of equal value. 
The language of some of the discussion may lead the reader to confuse sampling 
variability and bias. The author now includes as sampling a number of cases in which 
the essential problem is one of forecasting or prediction. Considering the confusion 
there has been in separating problems of sampling and forecasting, this is to be de- 
plored. Again, the examples in marketing research are devoted to problems and 
strategies in product development and testing rather than to sampling as such. 

These points do not seriously detract from the value of the book. It should be very 
useful, as was the original pamphlet, for conveying the elements of sampling to per- 
sons who are not technicians (or not even statisticians) but have occasion to use 
sampling or participate in sampling projects. The case histories may also be helpful 
in persuading administrators of the practicability and value of sampling by the ex- 
ample of large and respected business as well as government. The author has suc- 
ceeded very well in his objectives. 


Practical Business Statistics. Frederick E. Croxton and Dudley J. Cowden. Englewood 
Cliffs, N. J.: Prentice-Hall, Inc., 1960. Pp. xx, 701. Trade, $10.35. Text, $7.75. 


Ricuarp C. Hensuaw, Jr., Michigan State University 


~ subject matter of this book is entirely conventional, although the emphasis 


in some instances is substantially different from that of any other elementary 
business statistics textbook with which I am familiar. It is a smorgasbord of statistical 
techniques, many of use in business and economics and others of only peripheral 
interest. 

Its strongest point as a textbook of business or economic statistics is the analysis 
of time series; in this regard it is better than any of its competitors, to the best of my 
knowledge. But, although some of the other materials in the book are superb, it has 
so many serious deficiencies that its overall value as a business statistics textbook 
is severely compromised. 

The treatment of statistical inference and quality control is the most lengthy that 
I am aware of in an introductory textbook on business statistics. In addition to a 
specific chapter on the latter subject, there is throughout a heavy emphasis on quality 
control problems. It is true that many of the business statistics books of the past 
decade have contained a brief introduction to statistical quality control, but the 
emphasis in this book is far out of proportion to the topic’s importance in educating 
students of business or economics. 

One can surmise that Prof. Cowden, who revised the book, naturally drew upon 
the materials that he had accumulated in the process of writing his book on statis- 
tical quality control. He perhaps justifies this emphasis on the grounds that although 
quality control is a peripheral area of business statistics, it is a rich field for applica- 
tions of statistical inference. Some business statisticians may hold that the first 
part of my last statement is debatable, but the latter part is incontrovertible. 

Although two chapters are devoted to the concepts and methods of computing 
index numbers, the book contains no description of the many well-known business and 
economic indicators which are synonymous in many people’s minds with the term 
business statistics. 

The difficulty of the material is quite uneven: about one-fourth of it is too ele- 
mentary for a college course, one-half is at about the right level, and the remainder is 
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too succinctly presented for undergraduate students in business administration or 
economics. Furthermore, there is a considerable amount of unnecessary repetition. 
If the parts of the book that are too elementary were upgraded and some of the 
repetition and material on quality control and statistical inference were eliminated, 
especially the overly-concise parts, the book could be shortened by 200 to 300 pages 
or even more. This would be highly desirable, because as it now stands it is more 
like an encyclopedia than a textbook. Since there are no homework problems, the 
unfortunate student whose professor adopts the book will undoubtedly be forced to 
buy a workbook in addition to a very large, expensive text. 

There are no elementary business statistics textbooks on the market that contain 
a modern treatment of the subject and the situation is not improved by the pub- 
lication of this book. A well known modern treatment of the subject at the graduate 
level is Schlaifer’s Probability and Statistics for Business Decisions. 

The only places that the existence of any of the new business statistics techniques 
are even hinted at in the book under review are in Chapter 3: “Uses of Statistics in 
Business,” and in a small section of about one page entitled “Statistical Decisions” 
(pp. 350-1). The former is so superficial that it is useless and the latter is so short 
and inept that it would have been better not to have included it at all. In the last 
sentence on page 351, Professor Cowden writes “Scientific statistical decision making 
is at present, therefore, primarily of theoretical interest.” It is my impression that 
it is the extent of its contribution to business and economic theory that is debatable, 
while the pragmatic value of many of the techniques of statistical decision theory is 
incontestable. Mathematical programming, queuing theory, and production and 
inventory control techniques are some of the better known applications of dem- 
onstrated practical value. 

In his statement quoted above, Professor Cowden is no doubt referring to the 
theory of games or mathematical strategy and what has evolved therefrom. Admit- 
tedly this area of statistical decision theory is primarily conceptual. In a first-class 
college education what can we possibly teach that is more important than concepts 
and theory, however? 

After patiently reading 38 chapters and some 600 pages, I came to the last chapter; 
the author had promised in the preface that the idea of a mathematical model, and 
its relation to the method of analysis, would be extensively treated here. Here at 
last, I thought, the new work in simulation of business systems via mathematical 
models would be covered. Instead, it turned out to be an excellent chapter on analysis 
of variance and chi-square tests. Likewise tabulating machines are illustrated and 
described in chapter 10, but electronic computers are not mentioned. 

Finally, although the book contains some footnote references, it has no bibliography 
and, unsurprisingly, it contains its full quota of the typographic errors to be expected 
in the first printing of a new edition. 














































Modern Sampling Methods: Theory, Experimentation, Applications. Palmer O. Johnson 
and Munamarty S. Rao. Minneapolis: The University of Minnesota Press, 1959. Pp. 87. 
$4.00. 










Harowp Nissextson, U. S. Bureau of the Census 


HE authors of this short monograph have set themselves the task of presenting to 
workers in the social sciences the theory for multi-stage sampling of a finite 
population, illustrating the theory with experimental sampling, and presenting rec- 
ommendations for applications of the theory. They have not done well on any of these 
three counts. 
The first two chapters present formulas for the simple unbiased estimate of the 
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mean per element in the population sampled, the variance of that estimate and an 
unbiased estimate of the total variance, for a variety of multi-stage designs. Variety 
is obtained by specifying different combinations of methods in selecting the sample 
units at the different stages. The methods considered are simple random sampling 
without replacement (R), sampling with probability proportional to size (P), 
systematic random sampling (S) with S preceded by at least one stage with R or P, 
and the taking of all k-th stage sampling units within the (k—1)-th stage units selected 
for the sample (C). The derivation of the formulas presented in the text is discussed 
in a mathematical appendix. The principal tools are conditional expected values 
and variances and a generalized notation. 

In Chapter 3 the authors define a linear cost function for a multi-stage design and 
its expected value. They note that optimum allocation of sample among the stages 
could be arrived at by minimizing the variance of the estimate for a fixed cost, but 
do not give explicit methods or formulas for this and they make no further reference 
to the concept of optimum allocation. 

Chapter 4 is devoted to examining eight three-stage designs for a population of 
22,209 high school juniors in 432 Minnesota public schools during 1953-4. The 
schools are divided into four types: 70 junior-senior high schools, 61 senior high 
schools, 82 four-year high schools, and 219 six-year high schools. The types are taken 
as the first stage sampling units (s.u.’s), schools as the second stage s.u.’s and students 
as the third stage s.u.’s. Two variables are considered: the student’s score on the 
American Council on Education Psychological Examinations for College Freshmen 
and his total score on the Cooperative English Test of the Educational Testing 
Service. In the author’s notation, the eight designs are: (1) RiR2Rs, (2) PiP2Ps, (3) 
P,P Cs, (4) CiP2Cs, (5) CiR2Cs, (6) RiP2Cs, (7) PiP2Ss, and (8) CiP2S;. Here the 
letters R, P, S, and C have the meanings indicated above, and the subscript on a 
letter denotes the stage to which it applies. Thus, C, denotes stratified sampling of 
schools (a fact never pointed out by the authors). Fixed sampling rates are specified 
for each stage with R, P, and S regardless of the design: 50 per cent for the first stage 
and 10 per cent for the other two. Thus, designs 1, 2, and 7 represent an over-all 
sampling rate of 4 per cent of all students; design 8 a rate of 1 per cent; designs 3 and 
6 a rate of 5 per cent; and designs 4 and 5 a rate of 10 per cent. Data are given for an 
actual sample selected using each of the eight designs, and the computations shown 
for obtaining the estimate of the mean score and an estimate of its variance. For all 
except designs 7 and 8, the authors also compute the exact variance of the estimated 
mean by computing the variance due to each stage using all the data for the popula- 
tion. The authors then rank the eight designs on the basis of “accuracy of estimation 
and efficiency.” The principles by which this ranking was established are not ex- 
plained, but they apparently do not take into account differences in sample size 
between the designs. 

At the end of Chapter 4, and in the final Chapter 5, the authors set forth their 
conclusions and recommendations for the practical application of the theory they 
present. 

What have the authors accomplished of the task they set themselves? 

In the modern literature of sampling, the central problem of sampling theory is 
taken to be the design of sampling and estimation procedures so as to maximize the 
amount of information per dollar within existing administrative restrictions. This 
problem provides the underlying motivation of all five sampling texts cited in the 
authors’ bibliography. Yet, apart from noting the possibility of optimum allocation, 
the authors have not dealt at all with this central problem. As a result, the monograph 
is essentially a cookbook of variance formulas rather than a guide to the design of 
efficient sampling and estimation procedures. Because of this, too, most of the authors’ 
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observations tend to be meaningless. For example, the first one of their “three most 
significant recommendations” for reducing sampling variance and “increasing the 
efficiency” of a sample design consists of the observation that any stage for which the 
sampling rate is 100 per cent (Method C) does not contribute to the variance, and 
the recommendation of “judicious” use of C. It must be conceded that the observa- 
tion is technically correct. In fact, in the absence of any consideration of cost- 
efficiency the only possible objection is that the authors were too conservative in 
their recommendation. 

The authors have written explicitly the variance of the simple unbiased estimate 
of the mean for more multi-stage designs than anyone else, but few of the designs are 
worth considering. The problem of estimating components of variance is not treated, 
nor is the possibility of estimation procedures other than the simple unbiased estimate. 
Stratified sampling appears, not as a technique for use in combination with other 
techniques, but as a limiting case of multi-stage sampling in which the strata are 
first stage units and are sampled at 100 per cent. As an exercise in mathematics the 
monograph is equally disappointing. There is nothing essentially new. The definitions 
and notations are poorly explained, sometimes non-standard, and the proofs incom- 
plete. There are a number of misprints in the text. 

Perhaps the only real value of the monograph lies in the numerical data presented, 
which a competent person teaching a sampling course might adapt for purposes of 
illustration. 

The tone of the monograph is well illustrated in the Preface where the authors, 
with belated prescience, “foresee that these multi-stage sampling models will be of 
special value in large-scale investigations, such as national sample surveys.” The use 
of the future tense in the statement, I am sure, does not represent the awareness of 
the authors as to the present state of the art. It does accord, however, with the 
naive tone throughout. 

It is to be regretted that the senior author, a well-known social scientist and statis- 
tician, did not live to revise this monograph. It does not do him justice. 


The Demand, Supply, and Price Structure of Eggs. M. J. Gerra. (Tech. Bulletin No. 1204) 
Washington, D. C.: U.S. Government Printing Office, 1959. Pp. iii, 157. $.50. 


Antuony P. Stemperger, North Carolina State College 


HE report is divided into three parts: supply and demand factors for eggs, statis- 

tical models of the egg economy, and seasonal and price variations in the egg 
sector. The first part analyzes in some detail the economic and physical factors that 
influence supply and demand of eggs. 

The main purpose of the first section is to show that current egg prices influence 
current production; that is, that the egg sector is simultaneously structured and, 
therefore, a system of simultaneous equations is appropriate to use in estimating 
structural coefficients. Although the argument for simultaneity is lengthy, one gets 
the impression at times that the author is actually trying to show non-simultaneity. 

There is a good discussion on demand for eggs, and results from previous studies 
using cross-sectional data are analyzed. The conclusion reached is that perhaps 
tastes and preferences have as much influence on egg consumption as do prices and 
incomes. One wonders if the author really believes this, since these factors are not 
mentioned in the statistical results which follow. Included also is a brief discussion of 
short- and long-run supply curves and the factors influencing each. 

The second part of the report deals with the statistical model of the egg economy. 
An eleven-equation model is developed and solutions obtained for two forms using 
the method of limited information. Model II differs from Model I in that a number 
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of predetermined variables in the first model are excluded in the second. For each 
model, estimates of structural coefficients are also obtained by the method of least 
squares. This is somewhat surprising considering the effort expended in part I in 
developing the thesis that a system of simultaneous equations is the appropriate 
statistical technique. Perhaps this is done because traditionally researchers use both 
techniques. 

One of the variables used in the supply equations is egg-feed ratio which pur- 
portedly influences number of hens on farms during the year. This variable is not too 
well suited to the purpose; a much better relation would have been what one research 
worker calls the “egg value per hen-feed ratio.” An annual average egg-feed ratio is 
used and this aggregation becomes critical. It is stated that the number of layers sold 
or consumed on farms where produced throughout the year is related to the egg-feed 
ratio prevailing during that year. Actually, of the number of hens laying on January 1 
of a given year practically all are sold during that year regardless of the egg-feed ratio. 
This phenomenon is due to the physiological make up of the hen. The egg-feed ratio 
may influence the time of year during which layers are sold, however. An annual 
egg-feed ratio would not be expected to be significantly correlated with annual egg 
supplies. 

A lengthy argument is also made for meats, fish, and cheese as egg substitutes. 
Although these may be protein substitutes, they do not appear to be egg substitutes 
in the minds of homemakers who think of eggs primarily as a breakfast food, as a 
recent study of homemakers’ opinions of egg use shows. 

The structural coefficients obtained both by least squares and by limited informa- 
tion are somewhat disappointing. The four estimates of elasticity with regard to 
price range from —0.09 to —1.96 and none differ significantly from zero at the five 
per cent level. Based on the statistically most significant coefficients, a price elasticity 
of —.04 is obtained. Selection of this particular elasticity, however, depends on using 
one’s results to dictate the model used, a procedure unpalatable to most researchers. 

Price elasticities are compared with results of studies using cross-sectional data. 
Such studies show price elasticities higher than those obtained with time series data. 
These are summarily dismissed with the statement that this is to be expected since 
cross-sectional data bring in size and quality differences. In the opinion of this re- 
viewer, it is probably more realistic and important to pursue studies of demand by 
use of cross-sectional data rather than to continue to proliferate time series analyses, 
all of which are so similar that comparable results must be expected until someone 
develops new models or a new methodology. 

The final section of the report presents an interesting summary and analysis of 
seasonality factors in egg production, egg price variations, futures market prices for 
eggs, and government egg programs in effect since the 1930’s. An appendix describes 
how the simultaneous models were fitted to the data, shows how to derive reduced 
forms of the equations, and presents a test for non-linearity of functions. 

On the whole, this report is a comprehensive study which will be of value to 
workers in the poultry industry and to econometricians generally, even though the 
statistical] models of the egg sector add little to our present knowledge of egg demand 
and supply. 


The Compilation of Manufacturing Statistics. Frank A. Hanna. Washingt~~: U. 8S. Depart- 
ment of Commerce, 1959. Pp. xiii, 232. $1.75. 


MicHaet Gort, University of Chicago 


0 cLass of economic data has been used as intensively in research as statistics on 
American manufacturing. Indeed, some have argued that study of the manufac- 
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turing sector has consumed a disproportionate share of intellectual resources so that 
we are in danger of gaining a biased view of the structure of the American economy. 
This circumstance is largely a consequence of the success with which the Bureau of 
the Census has compiled a reliable body of manufacturing statistics. Professor Hanna’s 
book shows that this success has required much more than painstaking care: it has 
necessitated numerous decisions on complex conceptual problems. Some of these de- 
cisions, such as the choice of definition for an industry or a plant, have influenced 
in important ways the data developed. For example, prior to 1939 a plant to be con- 
sidered a manufactory had to have an annual gross value of output of at least $5,000. 
Subsequent to 1939, the requirement has been that it have at least one employee. 
This has had the effect of significantly reducing the number of plants shown. Profes- 
sor Hanna discusses a number of these definitional and conceptual problems and in- 
dicates the reasons for the choices made. He also describes the content of the Censuses 
of Manufactures, the Annual Surveys, and other periodic surveys. Finally, he devotes 
several chapters to the more practical aspects of data collection, such as developing 
a mailing-list for questionnaires, distributing the questionnaire, checking and tabu- 
lating the data, and deciding the most suitable form of publication. 

To users of economic data, the first five chapters of the book are the most relevant. 
In these chapters Professor Hanna discusses the various classes of manufacturing data 
which are collected and the industry and product classification systems used. In its 
selection of measures of output, in the choice of classes of inputs for measurement, 
and in the product breakdowns adopted, the Census Bureau has largely chosen to 
produce “general purpose” statistics as distinct from those designed to serve spe- 
cialized uses. This course offers greatest assurance against rapid obsolescence in the 
usefulness of the data produced as well as serving a multiplicity of needs, though none 
of them perfectly. What constitutes a significant body of general purpose statistics 
is, however, not always clear. The Census Bureau has adopted three criteria to assist 
it in choosing from among possible alternatives. First, the volume of output or in- 
put that is measured must be fairly large. Second, instability in the volume of a given 
output or input increases the importance of collecting data for it. Third is the 
importance of data from the standpoint of information requirements for defense 
mobilization. All three are, of course, valid considerations, but a number of others 
could readily be suggested. For example, the number of employees engaged in research 
is likely to be smail relative to total manufacturing employment but their stra- 
tegic role would seem to warrant separating them, in the data compiled, from other 
central office employees. Similarly, considerable interest attaches to the proportion 
of central office employment or payrolls represented by managerial as distinct from 
clerical personnel. 

In deciding what data to develop, the Census Bureau secures advice from several 
committees and, in addition, circulates its proposed questionnaires for suggestions 
among companies, trade associations labor unions, research organizations and uni- 
versity scholars, and others. A rather striking fact is that in 1954 of 3,755 such re- 
quests for suggestions only 62 went to research organizations and universities. While 
the weight given to advice cannot always be measured by the number of persons 
offering it, the latter group seems clearly under-represented. 

The reader of Professor Hanna’s book gains the impression that the Census Bureau 
takes great pains, before publishing data, to assure that the coverage of reports for a 
given industry is adequate and that duplication of data, for example as a result of 
inter-plant transfers, is not excessive. The Bureau also carefully ascertains if firms 
maintain records for a given statistic before requesting the information in a ques- 
tionnaire. On the other hand, it appears that far less emphasis is given to ascertaining 
if the underlying records which firms maintain are meaningful. For instance, in its 
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decision in connection with the 1958 Census to collect information on the value of 
plant and equipment in manufacturing by state and by industry, the Bureau as- 
certained that the relevant records are generally maintained by companies on a 
plant basis. However, no systematic study appears to have been made of the extent 
to which depreciation practices differ among plants and the effect of these differences 
(not to mention the effect of changes in construction costs and hence the prices at 
which plants were acquired) on the meaningfulness of the data. Tests of validity of 
the underlying records of firms are frequently difficult to conduct. On the other hand, 
numerous internal checks are frequently easy to execute on the basis of the data 
that are or could be collected. As one example, information on the difference between 
production and shipments could readily be compared with data on changes in in- 
ventories. 

No problem is more crucial to the compilation of manufacturing statistics than the 
use of an appropriate system of industrial classification. However well the classifica- 
tion system adopted may accord with theoretical criteria, it did not arise from the 
scrupulous application of economic theory. The author suggests that, “Perhaps 
industrial classification started as an ad hoc construction and developed through a 
series of ad hoc decisions. ...” At present, “ ... the classification of establishments 
is, with few exceptions, convertible to a list of manufactured products.” That is, only 
occasionally are other than product criteria (such as the nature of raw materials or of 
production processes) used to determine industry boundaries. The theorist, however, 
may be somewhat dissatisfied with the clarity of “product criteria,” for distinctions 
between products, as contrasted with minor variations of the same product, them- 
selves require definition. 

The reader of Professor Hanna’s study who expects much new information on the 
comparative reliability of various measures of output and input is likely to be dis- 
appointed. The book contains relatively little statistical information on the validity 
of various classes of Census data. For example, the author indicates that the con- 
tribution of several industrial sectors is not subtracted, as it needs to be, in the com- 
putation of value added in manufacturing. Yet there is no information on the mag- 
nitude of the problem. There is also no information on the industries in which various 
measures of input or output (e.g. payrolls and employment or shipments and value 
added) give similar or divergent indications of change over time. This is not meant 
as criticism of the author, for Professor Hanna could hardly have conducted all the 
necessary studies himself. It does point, however, to a need for additional studies of 
the characteristics of Census data. 

Some of the problems which characterize the records that firms maintain are also 
intriguing from the standpoint of the student of business decision-making. One of 
the difficulties associated with data on inter-plant transfers (that is, shipments from 
one plant to another of the same company) is the problem of selecting an appropriate 
price at which to value the shipments. The way in which companies normally value 
such shipments also throws light on one aspect of business decisions. When inter- 
plant transfers are occasioned by consumption within a company of part of its own 
output, measures of the profitability of the company’s separate divisions will depend 
upon the method of valuing inter-plant shipments. These measures of profitability 
can be expected, in turn, to affect decisions on output and investment. 

Roughly half of Professor Hanna’s book is devoted to the techniques adopted by 
the Census Bureau in collecting, processing, and publishing data. There is one chapter 
on developing a sample and a directory of companies to which questionnaires are to 
be sent. Another chapter deals with the problems of collecting returns. Two chapters 
are devoted to the processing of réturns and the tabulation of data. A section of the 
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book is also concerned with the handling of confidential information. The Census 
Bureau acquired its wisdom in these matters over a number of decades, and the 
author’s careful description of its practices should be invaluable to less experienced 
organizations contemplating the conduct of surveys. 


Cahiers du Séminaire d’Econométrie: No. 5, Production, investissements et productivité. 
René Roy, Editor. Paris: Centre National de la Recherche Scientifique, 1959. Pp. 178. 1,200 
Fr. francs. 


Ronatp I. McKinnon, Syracuse University 


His collection of eight papers, each with an introduction by M. Roy, covers top- 
Ties over a wide range of economic and statistical interest. 

The first article, “The Idea of a Production Function” by Henri Lavaill, is in the 
former category. The notion of a global production function as a set of most efficient 
points of component preduction techniques is derived. He distinguishes the case 
where any particular technique may be used a multiplicity of times in conjunction 
with others, from that where different possible techniques are mutually exclusive 
and may be used only once. This approach certainly leads to clarification of the mean- 
ing of a production function, although the early notation and diagrams are not always 
clearly explained. 

The article “Capital Seen as an Economic Variable” by M. T. Barna is a good out- 
line of the difficulties and pitfalls associated with the notion of a homogenous capital 
stock and production functions based thereon. In particular, Barna discusses the 
statistical and economic problems of valuation and quantification by volume or 
size. He suggests that the two methods of valuation most commonly used, book value 
and the market value of outstanding securities, are totally inadequate; the first be- 
cause it is based on original cost which is only imperfectly corrected and the second 
because it depends too much on the managerial skill of the particular individuals 
controlling the assets and on special market conditions. Instead, one should rely on 
expert assessment of the type used by insurance companies and taxation authorities. 
These professional assessors evaluate on the basis of replacement cost corrected for 
wear and technical change. Such information is difficult to assemble, but English 
data of this type has been used by Barna and gives startlingly different results from 
those obtained by more conventional methods. He examines the fallacies involved in 
using production functions not corrected for technical change and in fitting a single 
production function to several different industries. Although the article is quite 
negative in its treatment of established procedures, it is very worthwhile reading for 
statisticians interested in measuring capital formation. 

“Searching for a Criterion of Desirability for Important Investments” by Jacques 
Lesourne is an investigation of the relative merits of two common criteria for judging 
the desirability of an investment: (1) use marginal cost pricing and carry out the in- 
vestment if net revenue is positive, and (2) carry out the investment if it increases 
consumer surplus in the economy. (2) is strongly favored and a highly abstracted 
general equilibrium analysis is offered in support of it, but there are no feasible guide- 
lines for implementation of any of the results. 

“Production Cost Curves in Theory and in Practice” by Erich Schneider is a short 
note on the applicability of the traditionally shaped production cost curve, derived 
from agricultural considerations, to industrial uses. After looking at factors with 
widely different cost effects, he concludes that, a priori, no unique form of the cost 
function can be assumed in advance. 


“Reflections on Productivity Indices” by M. Marche is a good description from a 
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theoretical and practical point of view of the statistical problems which must be 
faced in choosing output, input, and productivity indices for diverse commodities. 
Marche considers different situations over time involving changes in prices and out- 
puts where the standard Laspeyres and Paasche indices are alternatively optimal. 
The properties of the continuous Divisia index are discussed and the conditions under 
which the value of the index is independent of the expansion path are given. Stress is 
placed on choosing appropriate indices for making particular type of comparisons; 
for example, those on the international level. Disparities arising in indices of value 
added and gross output are examined. From the viewpoint of presentation, the re- 
viewer feels those invariance qualities of indices which are intuitively “desirable” 
should have been listed as a complete set at the beginning and the ensuing discussion 
for each situation and index should have referred back to this set. As it is, they are 
introduced piecemeal and not considered completely for each case. Notwithstanding, 
the article both summarizes and contributes to the theory in this area and is highly 
informative. 

“Reflections on the Link Between Salary and Productivity” by M. R. Rémery is a 
short note in which the following two points are made: (1) that productivity is more 
closely affected by the way remuneration is related to output rather than the absolute 
level of remuneration, and (2) outdated formulae for relating work to remuneration 
can have very serious output effects. 

The last two articles, “Methods of Measuring Productivity and Setting Prices on 
Transport Routes” by Jacques Durand and “Measures of Productivity in Air 
Transport” by Gérard Pilé are not of general theoretical interest but the article of 
Pilé’s contains extensive well compiled data on inter-aircraft and inter-country per- 
formances in the air transport industry. 

Much of the writing in this collection of articles is not always up-to-date and, at 
times, not overly concise. But in overall quality and statistical interest, the articles 
of Barna and Marche must be singled out as being well worthwhile reading for statis- 
ticians working in these particular areas. 


Allocation in Space: Production, Transport and Industrial Location. L. Lefeber. Amster- 
dam: North-Holland Publishing Company, 1958. Pp. xv, 151. $4.50. 


James N. Boues, University of California, Berkeley 


OURTEENTH of a series entitled “Contributions to Economic Analysis,” this book 
F is primarily concerned with the application of linear and nonlinear programming 
methods to develop a general equilibrium theory of production, transportation, and 
industrial location under the assumption of pure competition. The author is aware 
of “the inherently monopolistic features of spatial organization” but aims “to provide 
a theory of optimal spatial allocation of factors and distribution of final goods” so 
that “the extent and significance of monopolistic deviation can be appraised.” 

Most applications of programming methods to economic analysis can be classified 
as either emphasizing the translation or development of economic theory in terms 
compatible with the mathematical requirements of the method or as emphasizing 
the application of the method to substantive research problems. This book clearly 
belongs to the first class. This is not to say that it would not be of interest to a prac- 
titioner, only that minimal reference is made to actual or potential use in applied 
research. 

Following an introductory chapter containing a brief discussion of the works of 
selected location theorists, the book is divided into three successively more com- 
prehensive and complex parts, the first dealing with the spatial allocation of produc- 
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tive factors, the second with the spatial equilibrium of production and distribution, 
and the third with the choice of industrial location. Finally, in the Appendix, largely 
mathematical, the preceding analysis is enlarged to the point, that the adjectives 
“general equilibrium” are in fact appropriate. Perhaps even here it is not completely 
appropriate since only the first two of the three aspects of production, changes in 
form, location and time, are treated. 

Throughout the text, the simplest possible models are used to develop and illus- 
trate the corresponding theory. For example, the model for Part I includes only two 
locations in each of which are located specified quantities of each of two factors of 
production and in each of which a single final good is produced. The cost of distribu- 
tion of the final goods is assumed to be zero and their prices are assumed to be given. 
The factors can be transported by transportation activities that use factors drawn 
from either location. The problem, then, is to maximize the value of production by 
selecting the combination and level of production and transportation activities sub- 
ject to the appropriate restrictions. It would appear to be pedagogically desirable to 
consider first the completely linear case using linear programming methods and then 
to consider the case where the feasible convex region has nonlinear boundaries. 
Actually, much of chapters two through four are concerned with what the author 
describes as “common sense” derivation of efficient nonlinear transformation rela- 
tions between the two goods produced by considering in detail a series of special 
cases differentiated by alternative conditions on the transportation of factors from 
one location to the other. The reader would be well advised to skim this section and 
concentrate on the remaining chapters. 

The author assumes that the reader is reasonably familiar with linear programming 
theory and provides adequate citations to elementary textbooks for those readers who 
are not already prepared. 


Principles of Engineering Economy. Fourth Edition. Eugene L. Grant and W. Grant 
Treson. New York: The Ronald Press Company, 1960. Pp. 574. $8.00. 


8. B. Lrrraver, Columbia University 


His book is in its fourth edition and originally appeared in 1930 under the name of 

the senior author. It has done yoeman service as a college text for undergraduate 
engineering students and will continue to be of excellent service to those requiring 
acquaintance with cost determination on engineering and industrial projects. The 
expression “Engineering Economy” is appropriately used to designate the problem 
of minimizing engineering and industrial costs and not to imply an exposition on 
on economics. 

The text is divided into four parts. In the first, there is some general discussion on 
choosing between alternatives. In the second part, a hundred pages are devoted to 
conventional interest, present worth and rate of return calculations. This material 
is thoroughly and clearly presented for its purposes. Certain mathematical derivations 
are reserved for the appendices. In the third part, some three hundred pages are 
devoted to a variety of cost studies dealing with depreciation accounting, increment 
and sunk costs, the effects of income tax considerations and assets retirement and 
replacement theory, among other topics. Appendices on various interest formulae 
and replacement models, interest tables and references comprise the fourth part. 

This material is entirely classical and some of it might be developed from a broader 
and more modern perspective for persons of greater sophistication and adequate 
background in mathematics, probability and statistical inference, and modern 
methods of decision making. The essential point of view of this work is that of find- 








204 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1961 


ing a minimum cost from among those of a number of reasonable alternatives. The 
techniques for these cost calculations and the kinds of alternatives to consider are 
fully and simply presented. Example and illustration abound and are well used. 
Most engineers fortified with this material will find use for it and teachers of the sub- 
ject will find this book very helpful. 

One cannot help observing, however, that this text runs the hazard, as do so many 
elementary engineering college texts, of being incomplete when avoiding probabilistic 
and statistical aspects of their subject matter or of providing inadequate and possibly 
misleading treatment because the reader is assumed not to possess the necessary back- 
ground. There may be a good case for intensifying the study of mathematics, prob- 
ability and statistical inference in the first two years of engineering and science courses. 


City Expenditures in the United States. Harvey E. Brazer. New York: National Bureau of 
Economic Research, 1959. Pp. xi, 82. $1.50. Paper. 


Howarp G. Scuatuer, Tulane University 


HIS is a study of the differences in the per capita expenditures in 1951 of 462 of 

the 481 (data were not available for 19) cities whose population exceeded 35,000 
in 1950 and the per capita expenditures in 1953 of the 40 largest cities in 1950. The 
study deals with general operating expenditure only. Capital outlays, insurance and 
utility accounts, and debt repayment are excluded. Brazer first examines the varia- 
tion in city expenditure and then analyzes the factors associated with this variation. 

In the first section, Brazer establishes that the variation in per capita expenditure 
for the 462 cities is high for both total and the individual functional categories and 
that the same factor or factors are not responsible for the variation under the in- 
dividual functional categories. He further establishes that the state of location in- 
fluences the per capita expenditure patterns. The variation of per capita expenditure 
levels is high within states but the variance between states is significantly greater. 
The ratios between variances exceed the ninety-ninth percentile value of the F dis- 
tribution for total expenditure and all the functional categories. An examination of 
mean expenditure levels grouped by census regions, shows that forces influencing 
expenditure patterns extend beyond state boundaries. 

The second section of the study has four major parts: (1) an examination of the 
relationships between six economic variables and the per capita expenditures of the 
462 cities, (2) an analysis of the relationship between the six variables and the cities 
in California, Massachusetts, and Ohio, (3) an examination of the association be- 
tween eight variables and the per capita expenditures in 1951 of the forty largest 
cities, and (4) an analysis of the variation in the per capita expenditures of the 462 
cities grouped by type of city. 

For the first two parts, multiple regression techniques are used. California, Massa- 
chusetts, and Ohio are considered separately since the variation in expenditure levels 
within these states is considerably lower than for all 462 cities. The six independent 
variables selected by Brazer are (1) population size, (2) population density, (3) rate 
of population growth 1940-1950, (4) median family income in 1949, (5) per capita 
intergovernmental revenue, and (6) employment per 100 population in manufactur- 
ing and trades and services. 

Multiple regression techniques are also used for the analysis of the forty largest 
cities. However, three independent variables are added—ratio of the population of 
the central city to population of the standard metropolitan area, number of children 
in the public schools per 1,000 population, per capita amounts spent directly on wel- 
fare by states. Population size is dropped from the analysis. Education and welfare 
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expenditures are considered in the forty-cities analysis whereas they were not in the 
previous analyses. 

In the last section, the 462 cities are grouped according to the following classifica- 
tions: (1) core city of major metropolitan area, (2) core city of minor metropolitan 
area, (3) high-income residential suburb, (4) low-income residential suburb, (5) in- 
dusirial suburb, (6) independent city, and (7) major resort area. Brazer compares 
variance between groups and within groups and in all categories the ratios exceed 
2.8, the Fo.99 value. 

Broadly stated, the major conclusions which Brazer draws from his analysis of the 
factors associated with the variation in city expenditures are as follows: (1) popula- 
tion size is of minor, if any, importance as a determinant, (2) population density and 
the ratio of the city’s population to metropolitan area population are both important, 
(3) per capita intergovernmental revenue and median family income are positively 
associated with per capita expenditure, (4) the employment ratio is of minor im- 
portance, (5) the ratio of children in the public schools per 1,000 population is signif- 
icant for the large cities, (6) all independent variables combined have considerable in- 
fluence but still leave a large amount of unexplained variation, and (7) the classi- 
fication of cities indicates that the type of city is important in explaining the varia- 
tion in per capita expenditure. As Brazer suggests (p. 68) “there is no facile means of 
explaining the tremendous range of differences in the levels of city expenditures.” 

It is this writer’s opinion that Brazer’s major contributions lie in his careful assess- 
ment of the influence of population variables and his establishment of the importance 
of the type of city and its region and state of location. His handling of population 
variables represents a substantial improvement over previous efforts (cited pp. 13-6). 

Finally, Brazer’s study points up the paucity of our knowledge cbout the produc- 
tion of goods and services by city governments. (Brazer, himself, emphasizes the 
need for further research, p. 68.) We do not, for example, have any information about 
the nature of the production functions involved, the relationship between factor 
prices paid and the size of the governmental unit involved, or consumers’ preferences 
for the various kinds of public goods and for public versus private goods. Perhaps 
case studies of extreme observations, as Brazer suggests (p. 68) would begin to yield 
some of the necessary information. 


Social Accounting and Economic Models. Richard Stone and Giovanna Croft-Murray. 
London, England: Bowes and Bowes, 1959. Pp. 88. 9s 6d net. 


H. S. Hourmaxker, Harvard University 


_— aim of this small volume is to show how the estimation of national income 
and related concepts “leads inevitably to the construction of a complete system of 
social accounts,” and from there to economic model-building to make the system 
more suitable for analysis and forecasting. 

In Chapter I a simple accounting framework is described and illustrated with 
British data. The next chapter introduces input-output analysis not, as is usually 
done, by technological considerations, but through a presentation of the national 
accounts in the form of a transactions matrix. 

In Chapter III the authors proceed to model-building by introducing technological 
and behavioral relations. There is some discussion of a dynamic input-output model 
in which inventories and output are subject to continuous adjustment depending on 
the difference between actual and desired levels. Macroeconomic models of the 
Tinbergen-Klein type are barely mentioned. 

The most interesting part of the book is Chapter IV, devoted to a summary of 
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recent work (much of it already published in the journals) by Stone and his associates 
on consumer demand. An example concerning the influence of supply factors on esti- 
mates of demand elasticities, intended to justify the use of single-equation methods, 
is too special to be convincing. There is a glowing account of the lognormal Engel 
curve due to Aitchison and Brown; unfortunately it fails to mention the computa- 
tional obstacles associated with this curve, and refers only obliquely to the difficulty 
of applying the Aitchison-Brown approach to time series, and indeed of reconciling 
it with theoretical notions of consumer’s choice. Dynamic demand equations, based 
on the adjustment principle, are treated in some detail, again with empirical illus- 
trations. One may question whether the authors are correct in treating perishable 
goods as durable goods with a lifetime of one year. A lifetime of zero years would seem 
more appropriate; if this makes nonsense of some of their formulae, so much the 
worse for the underlying theory. 

Another new development to which the reader is introduced is the linear expendi- 
ture system, in which all commodities are considered simultaneously on the basis of 
a very simple set of demand equations, subsequently made more flexible by combina- 
tion with dynamic elements. The resulting estimates of short-term and long-term 
elasticities deserve close study. The authors recognize that the various approaches 
presented in this chapter remain to be integrated, but they seem to be unduly optimis- 
tic on the prospects for doing so. Thus the assumptions of the linear expenditure 
system flatly contradict the findings on Engel curves whether of the lognormal or of a 
more conventional form. 

This little book is packed with information and its format is a welcome change 
from the heavy tomes to which American publishers are addicted. The often extreme 
conciseness of the writing, however, restricts its appeal to the specialist. This is 


regrettable because the econometric work with which it deals merits wider attention 
than it has so far received. 


Automatic Data-Processing Systems: Principles and Procedures. R. H. Gregory and 
R. L. Van Horn. San Francisco: Wadsworth Publishing Company, 1960. Pp. 705. $11.65. 


Danreu Tercuroew, Stanford University 


“wn 1956 the Army Ordnance Corps initiated an automatic data-processing systems 

(ADPS) career program for education of top management, training in needed 
skills, and career development. One urgent requirement in this program was a com- 
prehensive textbook; to fill that need, this ADPS textbook was developed and written 
under contract with the Massachusetts Institute of Technology.” 

The book does not deal with the processing of data in the statistical sense; it does 
not discuss the use of computers for the analysis of data from surveys or experiments. 
Instead, it deals with the processing of data as the term is now used in business, and 
covers the use of computers for record-keeping, maintenance of files, and preparation 
of reports and documents, etc. 

In its area it is a pioneering work that certainly fills the need for a textbook in 
business data processing courses. It is at present the best source available on the 
methodology of using computers in business firms for data processing. No previous 
knowledge of electronic computing equipment or business data problems is re- 
quired. The book can be read by management, but is of most use to persons who will 
be involved in day-to-day operations. 


Tables of Tolerance-Limit Factors for Normal Distributions. Alfred Weissberg and Glenn 
H. Beatty. Columbus, Ohio: Batelle Memorial Institute, 1955. Pp. 43. (Available without 
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charge while supply lasts from Batelle Publications Office, 505 King Avenue, Columbus 
1, Ohio.) 


Fr" the authors’ introductory remarks: 

“Tables of factors for use in computing two-sided tolerance limits are presented. 
In contrast to previous tabulations of the tolerance-limit factor K, we tabulate the 
factors r(N,P) and u(f, y) whose product is equal to K. This results in greatly in- 
creased compactness and flexibility. The mathematical development is discussed, 
including methods used to compute the tabulated values and a study of the accuracy 
of the basic approximation. A number of possible applications are discussed and 
examples given.” W.G.M. 


Statistics of Income... 1957-58, Corporation Income Tax Returns with Accounting 
Periods Ended July 1957-June 1958. U. S. Treasury Department, Internal Revenue Serv- 
ice. Washington, D. C.: Government Printing Office, 1960. Pp. 212. $1.50. 


DanreL M. Houuanp, Massachusetts Institute of Technology 


tatistics of Income: Corporation Income Tax Returns has been issued annually, with 
4J slight variations in title, close to 40 years. And over this period “users” have come 
to expect a well-defined (but nonetheless rich) statistical offering. Why, then, review 
the latest in a long line of annual volumes whose basic character is so well-known to 
researchers? Because an old friend can no longer be taken for granted. 

In recent years, much thought and energy have been expended by the Internal 
Revenue Service in improving the data and enlarging the scope of the tabulations in 
Statistics of Income. The 1957 volume (for tax returns with accounting periods ended 
July 1957-June 1958) contains some of the fruits of this effort. In addition to the 
basic set of data we had come to expect in previous years, we find a number of new 
tabulations, which because they aid in interpreting the data of the rest of the volume 
(as well as many other statistics), give evidence of an heightened interest in the 
analytical uses to which Statistics of Income may be put. Actually the provision of 
supplements and complements to the basic statistical diet has been going on for sev- 
eral years. In this latest volume the quantity of such material is particularly note- 
worthy, and it is useful to cite and comment briefly on the new data this report 
contains: 

1. In Table 23, “Depreciation Claimed, by Method, by Major Industrial Group,” 
there appears for corporations in 48 industrial groups (which exhaust the corporate 
universe) the amount of depreciation claimed on tax returns based on the straight 
line method and on what are called “other” methods (primarily declining balance 
and sum-of-the-years digits). That this is useful information is a point that need not 
be labored. For one thing it will be very helpful in interpreting the movements of 
corporate profits, particularly any comparisons involving pre and post 1954 data. 

All active corporations claimed $17.1 billions of depreciation—$12 billion (about 
70%) on a straight-line basis, $5.1 billion on other bases. From Table E on Page 4 
of the introductory discussion it appears that in this latter 30 per cent declining 
balance accounted for 16 percentage points, SYD for 11, and miscellaneous methods 
for the minor figure of 3. In all 781,000 corporations claimed a depreciation allowance, 
757,000 used the straight line method, and 160,000 used other methods. (As would 
be expected very few firms used “other methods” solely.) The data of Table 23 show 
some interesting variation by industry. In manufacturing over 35 per cent of all 
depreciation was based on methods other than straight-line; in the public utility sec- 
tor the comparable percentage was only 23. Even greater variation, of course, would 
show up in the finer industry detail of the table. Table E, noted above, points up the 
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rapid rise in the importance of the accelerated methods permitted in the Internal 
Revenue Code of 1954. Straight line depreciation fell from 89 per cent of the total 
in 1954-55 to 70 per cent by 1957-58. Over the same period declining balance de- 
preciation rose from 5 to 16 per cent, and sum-of-the-years-digits from 2 to 11 per 
cent. 
2. Table 25 presents a frequency count for 77 industry groups of selected balance 
sheet and income statement items. Now for the first time we know the number of 
corporations reporting a particular category of receipt, expenditure, asset, or liabil- 
ity. This clearly helps in analyzing the dollar amounts reported in these categories. 
For example, Table 3’s information that close to 10 per cent of all inventories in the 
manufacturing sector were reported as on a LIFO basis (with “Other than LIFO” 
representing 40 per cent, and the basis not stated for 50 per cent of the inventories) 
is complemented by the data of Table 25, viz., that less than 1 per cent of manufac- 
turing corporations reported inventories on a LIFO basis. 

3. Thanks to the computational revolution, we have in Table 24, spelled out in 
all its gory detail—“Number of Returns and Total Assets, by Ratio of Business 
Receipts to Total Assets, by Per cent of Net Income on Business Receipts, Size of 
Total Assets, by Industrial Division.” Given an industrial group (of which there are 
9 in the table), we have a set of five tables or, more properly, sub-tables (one for each 
of the asset-size classes used in Table 24) in which the rows are percentage return on 
business receipts, and the columns are the ratio of business receipts to total assets. 
Thus we are presented with a cross-classification of an operating ratio and a financial 
ratio; we can derive in addition the range in the rate of return on total assets as the 
product of these two ratios. The analytical uses of the data of this table are numerous. 
Some examples as well as some useful cautions in interpreting the statistics are given 
in Tables F, G, and H and the discussion on Page 5. (Parenthetically we may note an 
interesting Table on somewhat similar lines in Statistics of Income, 1955—Table 7 
which presents the following items: Number of Returns, Total Assets, Net Income, 
Income Tax, and Dividends, all for a cross-classification of net income amount and 
asset size groups.) 

The Internal Revenue Service plans to continue the computation and publication 
of financial ratios. Some of the ratios planned for the next few volumes are the 
following: 

a) Relationship between net profit and rate of inventory turn-over 

b) Relationship between net and gross profit 

c) Relationship between net profit in 1959, and 1957 and 1958 

4. Tables 26, 27 and 28 present, once again for the first time, data on changes in 
the income of identical corporate taxpayers. (In item I of Form 1120, taxpayers are 
asked to give their net income or deficit for the two preceding years.) Comparative 
net income figures are shown for 77 industry groups in Tables 26 and 27. In Table 26 
the comparison runs over the current year (1957-58) and 1956-57. In Table 27, net 
income for 1957-58 is compared with the net income of the same corporate taxpayers 
in 1956-57 and 1955-56. Finally, in Table 28 there appears a frequency count of 
firms (classified by 1957-58 asset size) which were in the same income or deficit class 
in both 1957-58 and 1956-57. 

5. At least as important as the new data now incorporated in Statistics of Income 
is the increased speed with which the annual] tabulations are being made available. 
The 1948 issue was ready for submission to the Secretary of the Treasury in Septem- 
ber, 1953, some 50 months after the last returns of the period were filed. By 1952 this 
had been reduced to 28 months, and the present issue was made ready in 21 months, 

In summary, significant improvements and expansion of scope have characterized 
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Statistics of Income in recent years. There is every prospect that in the next few years 
this will continue. The Internal Revenue Service clearly deserves our thanks. 


Toward a Geography of Price. William Warniz. Philadelphia, Pennsylvania: University 
of Pennsylvania Press, 1959. Pp. 117. $5.00. 


Grecory C. Cuow, Cornell University 


ERHAPS misinterpreting the title, the reviewer opened this book with enthusiasm, 
Peuedide to find another improvement on the existing models of price formation 
through space, or a better statistical implementation of them. He was disappointed 
as he moved along Chapter 1, intended to be “a brief historical survey of man’s 
considerations of the dimensions of society with special reference to economic ac- 
tivity. These dimensions have been considered to be mass, time and space.” Filing a 
complaint against the social scientists up to 1900 for their failure to “recognize the 
need for dimensional analysis” while the physical scientists were doing so, the author 
does acknowledge “the beginning of the development of a general theory of space 
economy” in the 20th century. The historical survey is sketchy. Ignored are the re- 
cent important contributions to price theory including spatial relationships, e.g., 
Enke (Econometrica, 1951), Samuelson (American Economic Review, 1952), Beck- 
mann (Econometrica, 1952), Fox (Econometrica, 1953), Fox and Taeuber (American 
Economic Review, 1955), Koopmans and Beckmann (Econometrica, 1957), and Lefeber 
(Allocation in Space, 1958). When the names of Lésch and Hoover are mentioned, we 
are told that “it is not the intent of this study to collect, analyze, criticize, and sum- 
marize the contributions to space theory.” 

An author is entitled to choose his topic (to be revealed shortly) and, let us say, 
he is free not to summarize previous contributions bearing upon it, but his work might 
suffer from not being built upon previous contributions. Now, at the end of Chapter 
1, the objective of this study is stated. It is “to investigate the geographical (i.e., 
spatial) distribution of farm prices of certain agricultural commodities.” 

Chapter 2, with the formidable heading, “Geography and Economics,” sets forth 
the hypothesis. To explain the price, at harvest date, of an agricultural commodity 
in each of the 48 states, three factors are selected. On the demand side is Gross 
Economic Population Potential at the center of a state, defined as the sum of income 
payments in all other states, weighted by the reciprocals of the distances from the 
centers of these states to the center of the state in question. On the supply side are 
Product Supply Space Potential and Product Supply Time Potential. The Space 
Potential at a state center is the sum of the quantities produced in all other states 
weighted again by the reciprocals of the distances. The Time Potential, in the middle 
of a month, is the sum of national productions in all other months weighted by the 
reciprocals of time intervals to the month in question. (National production in each 
of the other months contributes twice to the sum for Time Potential in a given month. 
For example, the Time Potential in March includes November production twice, the 
preceding November being four months away and the coming November eight months 
away from March.) Thus price at the harvest date in each state is to be explained by 
Gross Economic Population Potential for the state, Supply Space Potential for the 
state, and Supply Time Potential at the harvest date. All observations are averages 
for the ten years, 1940-49. 

As a first approximation, the hypothesis seems reasonable. No doubt, forces of 
demand and supply originating from other states will influence the price of a product 
in a given state, as long as the product can be transported. Also, the farther away the 
state, the weaker will be the influence of its demand and supply conditions, mainly 
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because of the higher transportation costs. However, the approximation is crude 
indeed, especially in view of the existence of a more comprehensive theory in econom- 
ics dealing with exactly the same problem. Given the demand and supply functions 
for a commodity in each region, and given the unit transportation cost from any 
region to any other, the problem is to determine the equilibrium prices in all regions, 
as well as all shipments and the quantities demanded and supplied in all regions. 
Enke (1951) solved this problem by electric circuit, and Samuelson (1952) did so 
using linear programming. Fox (1953) studied empirically the geographical distribu- 
tion of feed prices in the United States using this theoretical framework. This theory 
is more comprehensive than the author’s approach using different “potentials.” It 
explains exactly how the local demand and supply conditions in different regions, 
acting through the transportation network, simultaneously determine the prices in 
all regions. 

Chapter 3 is devoted to statistical methodology and the presentation of data. The 
commodities selected for study are wheat, potatoes, onions, and strawberries. Multi- 
ple linear regressigh of price on the three explanatory variables is proposed. Since the 
economic hypothesis is crude, the statistical analysis is concerned mainly with the 
sign of each of the regression coefficients in the multiple regression. All data are well 
presented. There are nice looking contour maps for Economic Population Potential 
and Supply Space Potential. However, sources of data are not always complete; 
terms such as income, production, and price are not clearly defined; possible dis- 
advantages of using ten-year averages are not discussed. 

The statistical results, summarized in Chapter 4, show significant (at 1% level) 
multiple correlation coefficients in all four cases, and correct signs for all three coef- 
ficients in each of the four regressions, but no significance tests are performed for the 
regression coefficients. The author further suggests the correlation between Economic 
Population Potential (measure of proximity to forces of demand) and Supply Space 
Potential (measure of proximity to sources of supply) observed at different locations 
as an indication of the degree of “market orientation” of the production of a given 
commodity. 

It is only fair for the reviewer to point out that, for the problem tackled inthis 
book, a more comprehensive theory exists which has already been implemented by 
empirical studies, and that we know more about price equilibrium in space and loca- 
tion theory (cf. references above) than this book indicates. Nevertheless, for a fresh 
approach using potentials to summarize crudely the forces of demand and supply, 
for confronting the simple hypothesis with data, for illustrating, once more, that 
crude economic hypothesis and elementary statistical techniques are still useful, 
and for the extremely interesting data presented, this book is recommended to the 
readers of JASA. 


Elementary Mathematics of Linear Programming and Game Theory. Z. G. Bennion. 
East Lansing, Michigan: Bureau of Business and Economic Research, Michigan State 
University, 1960. Pp. xvi, 140. Price unlisted. 


FRANKLIN R. Suupp, University of Illinois 


gw potential readership of this book is that limited but by no means empty set of 
businessmen and economists who although mathematically unsophisticated are 
mathematically curious (particularly about linear programming and the theory of 
games). The flood of recent publications on these same subjects addressed to this 
same audience strongly attests to its (the audience’s) substantial size. This is not an 
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implied criticism of Mr. Bennion for his role in contributing to any possible redun- 
dancy in the area; indeed at the time the manuscript was begun a very serious void 
did exist, and this volume represents an interesting, competent, and responsible at- 
tempt to close that gap. 

The author has wisely chosen to confine his discussion to strictly linear program- 
ming, the simplex algorithm and the zero-sum two-person game. One, however, feels 
cheated by the absence of a single reference to nonlinear programming or to the special 
algorithms for transportation problems, etc. Likewise the belated (last few pages of the 
Epilogue) mention of nonzero-sum and/or n-person games and games against nature 
is a bit bewildering. Furthermore the paragraph devoted to games against nature is 
more confusing than enlightening and results in an erroneous impression of the im- 
portance of these games in the decision making process. 

These criticisms are incidental when compared with the real accomplishments of 
the volume which include among other things the chapters entitled Degeneracy, 
Duality, and Game Theory and also the concise, lucid, and thoroughly enjoyable 
account of the Walras-Cassel general equilibrium model. The skillful illustration of 
the possible dangers of cycling inherent in a degenerate system and the clear ex- 
position of the Charnes technique for coping with this contingency deserve special 
note. Similar recognition is demanded of the detailed and calculated restatement of 
von Neumann’s proof of the minimax theorem, in which the author carefully guides 
the mathematical neophyte over a cairn of convex hulls and supporting hyperplanes. 

The chapter on Duality is also excellent although a bit involved. It is the reviewer’s 
opinion, however, that more generous use of the author’s economic knowledge would 
have made the exposition more palatable and rewarding to the intended audience. 
Certainly, the remarkable relationship between the primal and dual programs appears 
somewhat less remarkable (in fact, intuitively quite acceptable) if the dual variables 
are interpreted as imputed costs or accounting prices, given a profit maximizing in- 
terpretation to the primal program. 

If there is a major criticism of this small volume, it must be attributed to some 
conscious or unconscious indecision as to the book’s audience. The ostensibie reader- 
ship is obvious as the author frequently reiterates that he is writing for the mathe- 
matically unsophisticated. However, there are not infrequent occasions when the 
author appears to be addressing a somewhat more mature (mathematically) reader, 
as for example when he writes: 


“For the mathematically unsophisticated linear programming is more readily under- 
standable if the mathematical generalizations are preceded by relatively simple exam- 
ples.” 


The hazards of simultaneously writing for two audiences are obvious, and certain 
consequences are inevitable if this policy is to be pursued as it is here. If instead Mr. 
Bennion had focused his attention on the neophyte exclusively, the following changes 
would have been a natural concomitant and the usefulness of the book enhanced. 
The changes: 


i. A few completed business and economic examples of programming problems. (Only 
the pedagogical use of these illustrations is considered here, as the subsidiary benefit of 
demonstrating possible applications is outside the intended scope of this volume.) 

ii. Less frequent transitions from specific formulations of a problem to its general 
form and back again. (The elimination of these interruptions would fac"itate considerably 
the following of the argument being presented.) 

iii. A somewhat slower pace when introducing new concepts. (This would cushion 
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the shock of e.g., the following lines which appear just two pages after the definition of a 
matrix.) 


4, + ys = 5 — (3ys + Oy‘) 
2y1 + 3y2 = 6 — (Oys + 44) 
and the solution for y; and yz would be given by 


Ce) = 4c La) - iL» + Beda 
Y2 be 23 a2 

Without trying to gloss over the criticisms noted, it is only fair to conclude that 
this book is not only an interestingly and competently written volume, but it is also 


a very useful introduction to the mathematics of linear programming and game 
theory. This is a good book. 


The Role of Middleman Transactions in World Trade. Robert M. Lichtenberg. Occasional 
Paper 64. New York: National Bureau of Economic Research, 1959. Pp. 86. $1.50. Paper. 


Joun B. Henpverson, Allegheny College 


HIS paper is one of a series associated with the National Bureau’s study of the 

Structure of World Trade and Payments. li examines, as far as statistical records 
will allow, several types of merchandise transactions involving intermediaries be- 
tween producing and consuming countries. These dealings are, clearly, a possible 
source of systematic divergences between the trade statistics of buying and selling 
countries, since the differences may arise, not from erroneous reporting, but from 
different systems of reporting. But to explain how middleman transactions are re- 
sponsible for these inconsistencies requires more detailed information than is avail- 
able from most countries. The conclusions of the paper are therefore tentative, in- 
complete, and not easily applicable to the improvement of measurement of net inter- 
area trade balances. The merit of the paper rests rather in the fact that its cautious 
analysis shows the very great difficulty of making progress in the accurate reporting 
of the network of world trade. It points also to the hazards of using merchandise 
accounts, which are based largely on customs records, as a guide to the flow of claims 
and funds between countries. 

One of Mr. Lichtenberg’s problems is basic to the construction of balance-of- 
payments accounts. Since these accounts are a record of economic transactions be- 
tween the:residents of the reporting country and residents of other countries, there 
may be difficulty in establishing, for some transactions, what is the country of resi- 
dence of the foreign party to the transaction. Also, in determining the actual time of 
a transfer, trade records are not usually helpful, since they report movements of 
commodities and seldom their exchange of ownership. But another of Mr. Lichten- 
berg’s difficulties stems from his adopting a rather wide definition of “middleman” 
activity, the farther limits of which reach well into the services of direct investment 
abroad. It is tempting, especially when the object is the improvement of international 
accounts, to define a middleman as a third-country resident acting as more than finan- 
cial agent for, or carrier of, goods between the country of production and the country 
of consumption. But Mr. Lichtenberg’s extension of the concept to include the ac- 
tivities of multinational producers results in only a few references in the text to the 
petroleum industry’s structure. 

In this, as in other aspects of the study, limitations of statistical data determine 
what can be done. In the first of Mr. Lichtenberg’s analyses, an estimate of the pro- 
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portion of total world imports supplied by middlemen, it is the existence of records of 
imports both by country of origin and by country of purchase that determines his 
selection of a very unrepresentative sample of countries. Six of the seven are European, 
and Germany, with the highest percentage of purchases from middlemen, weighs 
heavily in raising the average of these purchases in relation to total imports. It is not 
surprising that imports from middlemen are of greatest importance to those countries 
which are not metropolitan centers for raw-material-producing dependencies and 
whose imports of manufactures are a relatively small part of total imports. But Mr. 
Lichtenberg’s estimate of the world total of middleman trade in 1952 is derived from 
a careful calculation, using the seven-country sample as a guide to the importance of 
certain countries as the source of commodities entering into middleman trade. The 
result, that this trade accounted for about 13 per cent of total world exports in 1952, 
is probably as fair an estimate as is achievable. (It could be objected that some of Mr. 
Lichtenberg’s material has little bearing on the result. For example, his tabulation 
of United States imports of merchandise laden in countries other than the country of 
origin is a tribute to the strategic significance of the Scheldt estuary rather than a 
helpful guide to the scope of merchanting trade.) 

The later sections of Mr. Lichtenberg’s analysis are even less well served with 
statistical data. The analysis of middleman trade in the produce of overseas terri- 
tories is related only to British experience and results only in inconclusive statements 
about the degree of control in the hands of British-resident companies. The discussion 
of switch trade effected through middlemen under cheap exchange rates is, as might 
be expected, not well supplied with firm figures and reads like a chapter of history. 
The declining importance of entrepét and trans-shipment in the twentieth century 
leads the argument of the paper back to the question of offshore merchanting, with 


the attendant difficulty that few countries provide trade records on a payments basis. 

In conclusion, this paper is a review of evidence rather than a firm quantitative 
analysis of a single tractable problem. The modesty and the pessimism of Mr. 
Lichtenberg’s comments on international organizations’ proposals for the improve- 
ment of trade reports are all the more affecting by reason of his gallant effort to wrest 
useful results from those statistics that are obtainable. 


Matrices. William V. Parker and James C. Eaves. New York, New York: The Roland 
Press Company, 1960. Pp. viii, 195. $7.50. 


Ricuarp V. Evans, University of Michigan 


ccoRDING to the authors this book is “written for introductory college courses on 
the subject and can be used not only with students of mathematics but also with 
students in other fields.” The chapter titles are: 


. Review of Elementary Concepts 
. The Algebra of Matrices 

. Transformations 

. Bilinear Forms and the Rank of a Matrix 
. Systems of Linear Equations 

. Congruence of Square Matrices 

. Quadratic Forms 

. Linear Vector Space Over a Field 
. Determinants of Square Matrices 
. Matrices and Polynomials 

. Special Matrices 


-mOoOn our wn 


_— 
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These topics are certainly the most usual ones used to introduce the mathematics 
student to matrices and to motivate his study. With the exception of one short 
illustration, no mention is made of the uses of matrix concepts in analyzing physical 
or behavioral problems. The concentration on abstract ideas is true not only of the 
text but also of the exercises. Thus, the reviewer believes that only readers who are 
highly motivated to study matrices will be able to complete any large segment of 
the book. An instructor using it as a text must supply his own motivational material. 
Under such an arrangement the book can be quite useful. The style of the entire book 
is evidenced by the style in which most theorems are handled. Theorem statements 
usually summarize preceding discussions. This same lack of introductory material 
pointing the direction which will be followed is true of the text at all levels of or- 
ganization. Further, there are no summaries so useful to the reader in checking that 
he has understood the concepts of a section or chapter. The authors have used almost 
an iron hand in topic arrangement to produce a sequence of topics and avoid concur- 
rent developments. This avoids some student confusion but perhaps it also means 
that the student will fail to see some of the important interrelationships among the 
topics. It is the unity of the various concepts which is the distinguishing characteristic 
of modern algebra as opposed to what is taught as high school or college algebra. 
Usually the authors have been able to avoid awkward presentations. The most 
striking exception is the postponement of the introduction of the elementary row 
operations until chapter 5 when linear equations are considered. Thus, when discuss- 
ing transformations of linear forms in chapter 3, an elimination method for solving 
X=AY is introduced, but this is done in terms of elementary transformations and 
matrix multiplication without mention of the undoubtedly more familiar elementary 
row operation concepts. The introduction of, matrices is via linear forms. This and 
the idea of transformations of linear forms leads naturally to matrix multiplication. 
On the other hand matrix addition is introduced only after it has become necessary 
in multiplication of partitioned matrices. The discussion of vectors is extremely brief, 
18 pages, and it is most unfortunate that the concept of dual space is not mentioned. 
Thus, the concepts of basis and coordinate representation are left unrelated and, of 
course, the opportunity to give further meaning to the concept of transposition is 
missed. To an instructor in an elementary course who is prepared to use supple- 
mentary material and supply intuition and motivation, this book provides a useful 
record of theorems and proofs as well as a source of exercises. 


A Primer of Real Functions. Ralph P. Boas. The Carus Mathematical Monographs, No. 
13. New York: John Wiley and Sons, 1960. Pp. xi, 189. $4.00. 


Fritz Herzoe, Michigan State University 


oas’ monograph presents to the beginning student of the subject a good intro- 

duction into the concepts and methods usually presented in a course on Real 
Functions. The treatment is essentially the classical treatment of the subject, which 
in the reviewer’s opinion enhances the value of the book from the expository stand- 
point. Topological concepts are kept to a necessary minimum, while emphasis is put 
on those ideas which are at the basis of every rigorous treatment of real analysis. 

The book consists of two chapters, “Sets” and “Functions.” In the first chapter 
the author bases his exposition on a knowledge of the real number system and then 
proceeds from the assumption that a set of real numbers which is bounded above has 
a least upper bound. He then derives the usual notions, concerning point-sets in 
metrical spaces. The later sections of this chapter are devoted to such items as con- 
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vergence of infinite series, sets of the first and second category, and sets of measure zero. 

The second chapter deals with continuous functions and their principal properties, 
uniform convergence, the Dini derivates, monotonic functions, etc. 

One subject not treated in the book is the theory of integration, although integrals 
are used in many places, the theory of the Riemann integral apparently being assumed. 
It seems regrettable that this important part of the theory of real functions has been 
omitted, though the author tries to justify this omission in his preface. 

The exposition is usually clear and concise. More difficult parts of the book are 
printed in small type and can be skipped without harm. The book contains a good 
number of exercises, whose solutions are found at the end of the book. 

The reviewer feels that the inclusion in the book of several topics deserve recom- 
mendation. Examples of this kind include the universal chord theorem (Sec. 14), an 
explicit example of an area-filling curve (Sec. 17), the equation f(z+y) =f(x)+f(y) 
(Sec. 20). 

It might be doubtful whether, as the author claims, a student with a knowledge of 
calculus should be able to read this book. There is no doubt, however, that a slightly 
more advanced student (one who has had advanced calculus) will know a consider- 
able amount of good mathematics after studying this primer. 


Freight Transportation in the Soviet Union: A Comparison with the United States. Ernest 
W. Williams. National Bureau of Economic Research (Occasional Paper 65), 1959. Pp. 
ix, 38. $.75. Paper. 


yes National Bureau of Economic Research is engaged in a study of Soviet Eco- 
nomic Growth. One part of this study deals with transportation in the Soviet 
Union. 

“The present paper attempts a comparison of freight transportation output in the 
Soviet Union with that in the United States in quantitative terms, a comparison of 
the contribution to that output of the several forms of transportation which serve in 
each nation, and some comment upon the reliability of the data which are available 
for the two countries. Considerable inquiry has been made into the means by which 
the Soviet output is achieved in order to permit a judgment of the reliability of the 
traffic statistics. For that purpose a large number of series concerning motive power, 
rolling stock, and operating statistics was developed. These data and the analysis 
based upon them are contained in the working draft, ‘Transportation in the Soviet 
Union’ which is later to be published, and are only referred to here when directly 
pertinent to the discussion of freight traffic.” (From the preface to Occasional Paper 
65.) W.G.M. 


Statistical Abstract of the United States: 1960. U.S. Department of Commerce, Bureau of 
the Census (Eighty-first Edition). Washington, D. C.: U. 8S. Government Printing Office, 
1960. Pp. xii, 1040, $3.50. 


stpE from bringing up-to-date over 1200 tables and charts, this eighty-first annual 
issue of the Statistical Abstract contains new tables on principal financial flows 
under the flow of funds concept. A number of new tables has also been added in the 
agriculture and in the social insurance and welfare section. In addition, an appendix 
links pertinent date in the Abstract with the series in the new edition of Historical 
Statistics of the United States. The Abstract also contains revisions of various series 
based on the 1958 Censuses of Business, Manufacturing, and Mineral Industries, but 
does not contain statistics from the 1959 Census of Agriculture or the 1960 Censuses. 
R.F. 
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Basic Readings in Social Security. U. S. Department of Health, Education and Welfare, 
Social Security Administration. Washington: U. 8. Government Printing Office, 1960. Pp. 
vi, 221. $1.00. Paper. 


S Basic Bibliography in Social Security” would be a more appropriate title for 

this list of references in the field of social security; it is not a book of readings. The 
present revised edition, issued to commemorate the 25th Anniversary of the Social 
Security Act, contains about 1,640 references on the subject, mostly new. The bib- 
liographv is intended to provide basic course materia] in the field, including legisla- 
tive reports and studies by agencies as well as books, articles and monographs on the 
subject. It is mostly annotated, though unfortunately a number of references are 
not annotated or are annotated in an incomplete manner. 

This bibliography could be of much greater use from a statistical point of view if 
it contained a section on sources of statistical data on social security. From a more 
general point of view, inclusion of a list of sources covered as well as mailing ad- 
dresses for these sources would also be of value. R.F. 


Population and Labor Force Projections in the United States, 1960-1975. U.S. Depart- 
ment of Labor, Bureau of Labor Statistics. Washington: U. 8. Government Printing Office, 
1959. Pp. iv, 56. $.40. Paper. 


roM the introduction: “This bulletin presents population and labor force projec- 

tions to 1975 and a detailed description of the method used in preparing the labor 
force projections. The estimates of the future population are one of a series published 
by the Bureau of the Census. . . . Because population projections are so fundamental 
to the projections of the labor force, an evaluation of the population trends is also 
included.” 

From the summary: 


“1. The population of the United States will probably reach about 226 million 
by 1975... 
2. Between 1955 and 1965 the labor force of the United States is expected to 
increase by 11 million, assuming favorable economic conditions prevail... 
3. Between 1965 and 1975, the labor force is expected to increase by 15 million 
persons... 
4. A large part of the increment to the labor force will consist of young people 
and married women. This ‘will mean an unusually large increase in the number 
of part time workers.’ ” W.G.M. 


Patterns of Industrial Growth 1938-1958. Statistical Office, United Nations. New York, 
New York: United Nations Department of Economic and Social Affairs, 1960. Pp. vii, 
471. $8.50. 


“rine dimensions and patterns of the material growth over the twenty years, 
1938-1958, in the industrial sector of countries of the world are measured and 
analysed in this publication for the first time. The distribution, according to areas of 
the world, stages of economic development and kinds of industry, of output and em- 
ployment in 1938, 1948, 1953 and 1958 is also delineated and the significant changes 
over these years, in the structure of industrial activity and resources are traced. 
“Part I is devoted to the presentation and analysis of internationally comparable 
data on the industries of mining and manufacturing, for the world as a whole and for 
countries grouped according to regions and degree of industrialization, concerning the: 


“Volume of production and employment, 
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“Level of output per person engaged and per head of population, 

“Distribution of production and employment by region and stage of economic 
development. 

“Part II consists of a chapter for each of the 75 countries or territories concerning 
which data have been compiled on the industries of mining, manufacturing, con- 
struction and electricity and gas that are as comparable internationally as feasible on: 


“Volume of production, 

“Value added, 

“Employment, in terms of both numbers of persons engaged and employees, 
“Wages and salaries paid, 

“Capacity of power equipment, 

“The number of industrial units. 


“The International Standard Industrial Classification has been utilized in classify- 
ing the data shown in this publication according to kind of industrial activity.” 

Included in this publication are three appendices. Appendix I covers the definitions 
of the concepts and items of data that are included. Appehdix II discusses the index 
numbers of the volume of industrial production, and Appendix III deals with the 


estimates of value added in current prices, and includes also certain related infor- 
mation. W.G.M. 


Yearbook of International Trade Statistics, 1958. Statistical Office, United Nations. 1959, 
Volume I, Pp. 157, $6.00. Volume II, Pp. 167, $2.00. Paper. 


T= ninth issue of this Yearbook is divided into two volumes. The first volume 
presents detailed data for individual countries giving the following information: 
total exports and imports annually from 1958 back to 1930 where possible, imports 
and exports by the Standard International Trade Classification code annually for 
1955-1958, and imports from and exports to principal countries annually for 1954- 
1958. It also contains summary tables on world trade by country for 1938, 1948, and 
1953-1958, world movement of crude petroleum and of coal and lignite, as well as 
price indices of goods in foreign trade. 

The second volume is essentially a summary volume. In addition to four summary 
tables on world exports and imports by country and commodity class, it presents a 
detailed tabulation indicating the flow of exports and imports for 1955-58 from each 
country to all other countries for individual commodity classes. R.F. 


The Materials of Demography: A Selected and Annotated Bibliography. Hope T'.. Eldridge. 
New York: Columbia University Press, 1959. Pp. xi, 222. No price listed. 


HIS volume is designed to help meet the need of teachers, students, and researchers 
for a selective bibliography of demographic literature in English. The survey 
covers the literature published prior to 1958, but because of the inevitable delays 
between compilation and publication, coverage of the more recent literature is in- 
complete. 
Comparable bibliographies of demographic literature in French, Italian, and 
Spanish are also expected. 
The section headings provide an indication of the book’s contents: 


I. General Works in Demography 
II. Population Problems and Theories 
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. General Demographic Study of Specific Populations 

. Problems and Methods of Demographic Analyses 

. Studies of Fertility, Marriage and Reproduction 

. Studies of Mortality and Morbidity 

. Studies of Population Distribution and Migration 

. Studies of Manpower and the Labor Force 

. Studies and Reports on Legislative Measures Covering Administrative 
Programs and Policies Affecting Population 

. Statistical Compendia and Bibliographies. W.G.M. 
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